PROPAGATION OF CHAOS FOR RANK-BASED INTERACTING 
DIFFUSIONS AND LONG TIME BEHAVIOUR OF A SCALAR 
QUASILINEAR PARABOLIC EQUATION 



BENJAMIN JOURDAIN AND JULIEN REYGNER 



Abstract. We study a quasilinear parabolic Cauchy problem with a cumulative distribution 
function on the real line as an initial condition. We call 'probabilistic solution' a weak solution 
which remains a cumulative distribution function at all times. We prove the uniqueness of 
such a solution and we deduce the existence from a propagation of chaos result on a system 
of scalar diffusion processes, the interactions of which only depend on their ranking. We then 
investigate the long time behaviour of the solution. Using a probabilistic argument and under 
weak assumptions, we show that the flow of the Wasserstein distance between two solutions 
is contractive. Under more stringent conditions ensuring the regularity of the probabilistic 
solutions, we finally derive an explicit formula for time derivative of the flow and we deduce the 
convergence of solutions to equilibrium. 



Introduction 

Let a,b : [0,1] -> 1 be continuous functions, with a > 0. For all u <G [0,1], let us define 
A(u) = J Q a(v)dv and B(u) = J Q b(v)dv. Let m be a probability distribution on K. We are 
interested in the nonlinear Cauchy problem on [0, +oo) x R: 




- l -d 2 x (A{F t {x)j)-d x (B{F t {x))), 

where H * ■ refers to the spatial convolution with the Heaviside function. 

The partial differential equation in (1) is called a scalar quasilinear parabolic equation. It is a 
model for several usual nonlinear evolution equations, such as the porous medium equation, for 
which B{u) = and the diffusion term has the particular form A(u) = u q , q > 1; or conservation 
laws, in which the diffusion term is linear, i.e. A(u) = a 2 u with a 2 > 0. A conservation law is 
said to be viscous if o~ 2 > and inviscid if a 2 — 0. A particular case of a conservation law is the 
Burgers equation, for which B(u) = u . 

In this article, we introduce a probabilistic approximation of the Cauchy problem by means 
of a system of scalar diffusion processes, interacting through their ranking. We then use this 
probabilistic representation to study the long time behaviour of the solution. 

A weak solution to the Cauchy problem (1) is a continuous mapping F : t € [0, +oo) > F t £ 
Ll oc {R) such that for all t > 0, F t takes its values in [0, 1] and for all g e C c °°([0, +oo) x R), 

g(t, x)F t {x)dx — I <?(0, x)H * m(x)dx 
1 



(2) 



- A(F s (x))d 2 g(s, x) + B(F S (x))d x g(s, x) + F s (x)d s g{s, x) j dsdx, 

where C£°([0, +oo) x R) refers to the space of real- valued C°° functions with compact support in 
[0,+oo) x R. 
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When A is C 2 on [0, 1], we say that F has the classical regularity if F is C 1,2 on (0, +oo) x M 
and solves (1) in the classical sense. The space derivative p of a solution F with classical regularity 
satisfies the nonlinear Fokker-Planck equation 

(3) d tPt (x) = l -d 2 x (a{H *p t {x))vt{x)) -d x (b(H*p t (x))p t (x)), 

therefore it is natural to consider the associated nonlinear stochastic differential equation 

ix t =X + J^b(H* P s (X s ))ds + j <*{H* P s (X s ))dW s , 
[ P t is the distribution of X t , 

where o~(u) := a(it) 1 / 2 , X has distribution m and is independent of the Brownian motion W. 
Due to the discontinuity of the Heaviside function, a direct study of this equation by classical 
techniques, such as the use of fixed-point theorems [37], seems out of reach (except when the 
diffusion coefficient a is constant, see [23]); therefore we introduce a linearized approximation 
of (4). 

In Section f , we call particle system a solution X n £ C([0, +oo), R") to the stochastic differential 
equation in R™ 

Xl' n = X* + f b(H * MlW n ))ds + f (c + a(H * tfPtf"))) dW l s , 
Jo Jo 

n 

n L — ' * 

i=l 

where (Xp)i>i is a sequence of i.i.d. random variables with marginal distribution to, independent 
of the R™ -valued Brownian motion (W 1 , . . . , W n ). Here, the term c n > has been added in the 
diffusion in order to ensure the well definition of solutions, and it is only required to vanish when 
n — > +oo. In Proposition 1.1, we prove that as soon as the function A is increasing and m has a 
finite first order moment, the flow of time marginals t <— > /i™ of the empirical distribution of the 
particle system converges in probability to the unique mapping t t— > P(t) such that the function 
F : (t,x) H> (H * P(t))(x) is a weak solution to the Cauchy problem (1). This function will be 
referred to as the probabilistic solution of the Cauchy problem. Our analysis is essentially based on 
results obtained for the particular case of the porous medium equation [25]. A crucial argument 
in the extension of this work is the uniqueness of weak solutions to the Cauchy problem stated in 
Proposition 1.2, the proof of which is adaptated from works by Wu, Zhao, Yin and Lin [41] and 
Liu and Wang [31] (see Appendix A). Proposition 1.1 has strong connections with recent results 
by Shkolnikov [35], see Remark 1.15. 

We then give two representations of the mapping t i— > P(t) as the flow of time marginals of 
a probability distribution on the space of sample-paths C([0, +oo),R). More precisely, if F is 
continuous on (0, +oo) x R, we give necessary and sufficient conditions on the coefficients of the 
Cauchy problem for the empirical distribution fi n = (l/n) Yh=i ^x^" of the particle system in 
C([0, +oo), R) to converge in probability to the law P of a weak solution to (4) in Subsection 1.2. 
Without assuming the continuity of F, we define in Subsection 1.3 the reordered particle system as 
the reflected diffusion process obtained by increasingly reordering the positions (X t '", . . . , X™' n ) of 
the particles. We prove in Proposition 1.14 that the associated empirical distribution jl n converges 
in probability to a probability distribution P on C([0, +oo),R) with time marginals P(t). 

Our motivation for introducing the particle system (5) is the study of nonlinear evolution prob- 
lems, as it has been done for particular equations by Jourdain [24, 25, 26]. However, such systems of 
so-called rank-based interacting particles also arise in several contexts (see the introduction of [21] 
for references), and have received much attention lately. In particular, motivated by the study 
of the Atlas model of equity markets introduced by Fernholz [15] (see also Banner, Fernholz and 
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Karatzas [4]), much work has been done about the rank-based stochastic differential equation 

n n 

(6) dX\ = £ l^ytybjdt + £ l {X]=Y i } a 3 &Wl 

3=1 J'=l 

where (l^ 1 , . . . , Y™) refers to the increasing reordering of (X^, . . . , X"). The case n = 2 is exhaus- 
tively studied by Fernholz, Ichiba, Karatzas and Prokaj [14]. For n > 3, Ichiba, Karatzas and 
Shkolnikov [19] show that strong solutions can be defined as long as there is no triple collision. 
Triple collisions are studied in [20] . Concentration of measure bounds for the local time at collisions 
and statistics related to this system are given by Pal and Shkolnikov in [32]. 

As far as the long time behaviour of solutions to (6) is concerned, Ichiba, Papathanakos, Banner, 
Karatzas and Fernholz [22] prove that under some convexity assumption on the sequence of drift 
coefficients (bj), the process of spacings (Y t 2 — Y^,..., Y t n — Y" ) converges in total variation to 
its unique stationary distribution when t — > +oo. When the sequence of diffusion coefficients is 
such that erf — a\ = • ■ ■ = of, — o^_y, this stationary distribution is the product of exponential 
distributions. These results extend the work by Pal and Pitman [33], in which tjj = 1 for all 
j. In this case, the particle system solution to (6) does not have any equilibrium, as the process 
of its center of mass is a drifted Brownian motion. However, the convergence to equilibrium in 
total variation of its projection on the hyperplane {x\ + • • • 4- x n = 0} can be deduced from the 
long time behaviour of the process of spacings [33]. Convergence rates are provided by Ichiba, 
Pal and Shkolnikov [21] using Lyapounov functionals. Based on the Poincare inequality satisfied 
by the stationary distribution, Jourdain and Malrieu [28] prove the convergence to equilibrium 
in x 2 distance with an exponential rate, which is uniform in n. However, due to the lack of 
scaling property in the dimension n for the x 2 distance, one cannot deduce from their result the 
convergence to equilibrium of the probabilistic solution Ft to the Cauchy problem (1), which is the 
purpose of Sections 2 and 3 of this article. 

In many cases, transport metrics, and the Wasserstein distance in particular, are contractive for 
the flow of solutions to parabolic equations: see von Renesse and Sturm [40] for the linear Fokker- 
Planck equation, Carrillo, McCann and Villani [11], Cattiaux, Guillin and Malrieu [13] and the 
recent work by Bolley, Gentil and Guillin [7] for the granular media equation and Bolley, Guillin 
and Malrieu [8] for the kinetic Vlasov-Fokker-Planck equation. We will prove such a contractivity 
property by a probabilistic argument and without further regularity assumption, and then take 
advantage of it to state the convergence to equilibrium of the solutions. 

Let us first recall some useful properties of the one-dimensional Wasserstein distance (see Vil- 
lani [39] for a complete introduction). Let p > 1. For all probability distributions \i and v on R, 
we define 

W p (fx,y):= inf E (\X - Y^) 1 ^ , 

where n(^, v) refers to the set of random couples (X, Y) such that X has marginal distribution \i 
and Y has marginal distribution v. As soon as both /i and v have a finite moment of order p, then 
W p (fi, v) < +oo. 

Given a right-continuous nondecreasing function F, we define its pseudo-inverse as F~ 1 (u) := 
inf{.T G K : F(x) > u}. Then it is a remarkable feature of the one-dimensional case that the 
Wasserstein distance W p (fi, v) can be expressed in terms of the pseudo-inverses of the cumulative 
distribution functions Fp := H * p, and F v := H * v as 

(7) W*M = [ \F-\u) F~ 1 (u)\ p du. 

Jo 

This leads to the following useful expressions: let (xi, . . . , x n ) £ R™, we denote by (j/i, . . . , y n ) its 
increasing reordering and by f/ 1 its empirical distribution. Then for all probability distribution 



(8) W*(ji n ,n) = ]T / \ Vl - F-^u^du < £ / la* - F-\u)\*du. 



l)/n 



4 



BENJAMIN JOURDAIN AND JULIEN REYGNER 



In particular when /i is the empirical distribution fx' n of some vector (a/ 1; . .. ,x n ) with increasing 
reordering (y[, . . . , y' n ), 



We finally point out the fact that we will indifferently refer to the Wasserstein distance between /z 
and v as W p (^i 7 v) or W P (F I1 , F v ). 

In Section 2 we study the evolution of the Wasserstein distance between two probabilistic so- 
lutions F and G of the Cauchy problem (1) with different initial conditions Fq and Go- We 
use the contractivity of the reordered particle system in Proposition 2.1 to prove that the flow 
t i— > W p (Ft, Gt) is nonincreasing. Then we provide an explicit expression of the time derivative 
of the flow in Proposition 2.3. Our work is related to results exposed in the review papers by 
Carrillo and Toscani [10] and Carrillo, Di Francesco and Lattanzio [!)]; a further review is given in 
Remark 2.5. 

Section 3 is dedicated to the convergence to equilibrium of the solutions. We call stationary 
solution the cumulative function F^ of a probability distribution m,^ with a finite first order 
moment, such that if F is the probabilistic solution of the Cauchy problem (1) with Fq = Foo, 
then for all t > 0, F t = F^. It is clear from (2) that F^ is a stationary solution if and only if it 
solves the stationary equation (l/2)d 2 {A{F aD )) — d(B(Foo)) = 0, in the sense of distributions. 

We solve the stationary equation in Proposition 3.1, extending the results of Jourdain and 
Malrieu [28] who deal with the viscous conservation law. Using the results of Section 2 as well 
as the probabilistic approximation built in Section 1, we then prove in Theorem 3.6 that the 
probabilistic solutions converge to the stationary solutions in Wasserstein distance. 

In [28] , the solutions of the viscous conservation law are proven to converge exponentially fast 
to equilibrium in \ 2 distance, under the condition that the initial measure m be close enough to 
the stationary solution m^. Theorem 3.6 does not involve such a condition, but the proof does 
not provide any indication on the rate of convergence of Ft to the equilibrium. As we remark 
in Subsection 3.3, one can recover an exponential rate of convergence in (quadratic) Wasserstein 
distance in the setting of [28]. 

Notations. Given a separable metric space S, we denote by V(S) the set of Borel probability 
distributions on S, equipped with the topology of weak convergence. The space C([0, +oo), S) of 
continuous functions from [0, +oo) to S is provided with the topology of the uniform convergence 
on the compact sets of [0, +oo). Besides, for all probability distribution /i <G "P(C([0, +oo), S)), 
the marginal distribution at time t > is denoted by \x t £ ViS). The canonical application 
"P(C([0, +oo), S)) — > C([0, +oo), 'P(S')) associates the distribution [i with the flow of its time 
marginals 1 1— >• fi t . Finally, if / is a real- valued bounded function then ||/||oo refers to the supremum 
of the function |/|. 

Assumptions. Our results are valid under various assumptions on the degeneracy of the parabolic 
equation. Let us introduce the following conditions: 

(Dl) The function A is increasing. 
(D2) For all u G (0, 1], a(u) > 0. 

(D3) There exists a > such that, for all u £ [0, 1], a(u) > a. 

Obviously (Dl) is weaker than (D2), which is weaker than (D3). 

We also introduce the two following conditions on the regularity of the coefficients: 

(Rl) The function a is C 1 on [0, 1]. 

(R2) The function a is C 2 on [0, 1], the function b is C 1 on [0, 1] and there exists f3 > such 
that the functions a" and b' are /3-H61der continuous. 

The condition (Rl) is a natural necessary condition for the Cauchy problem (1) to admit classical 
solutions. The stronger condition (R2) will be used in Lemma 1.6 to ensure the existence of classical 
solutions to the Fokker-Planck equation (3). 



(9) 




i=l 



i=l 
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Finally, the existence and integrability of stationary solutions will depend on the two following 
equilibrium conditions: 

(El) For all u G (0, 1), B(u) > 0, B(l) = and the function a/2B is locally integrable on (0, 1). 
(E2) The function a/2B is such that 

1/2 a(u)u , f 1 a(u)(l-u) , 

du+ N , dzt < +oo. 



2|B(u)| A/2 2|B(u)| 



1. Probabilistic approximation of the solution 

1.1. Existence and uniqueness of the probabilistic solution. Following [5], for all n > 1 
there exists a unique weak solution A™ — (X t ' n , . . . , A™'™) t >o to the stochastic differential equa- 
tion (5). We call it the particle system and denote by //" the random variable in P(C([0, +oo),R)) 
defined by /it™ := (l/n) Yh=i S x'."- 

Let T > 0, possibly T = +oo. We denote by ViiT) the set of continuous mappings t G [0,T) i-> 
P(i) G P(R) such that for all t G [0,T), the probability distribution Pit) has a finite first order 
moment and the function t i— > J R |x|P(t)(da;) is locally integrable on [0,T). Let -F(T) := {P : 
i — ^ (H * P(t))(x);P G Pi(T)}; note that T(T) C C([0,T), Lj oc (E)). The particular sets 
Pi(+oo) and J r (+oo) are simply denoted by Pi and J ' . 

Throughout this article, we will call probabilistic solution the solution to the Cauchy problem (1) 
given by the following proposition. 

Proposition 1.1. Under the nondegeneracy condition (Dl) and the assumption that m has a 
finite first order moment, there exists a unique weak solution F to the Cauchy problem (1) in T ' , 
and it writes F : it, x) n- iH * P(t))(x) where the mapping 1 1— > Pit) is the limit in probability, in 
C([0, +oo), P(R)) 7 of the sequence of mappings (4 /i™. 

The proof of Proposition 1.1 relies on Proposition 1.2 and Lemmas 1.3 and 1.4. For all n > 1, 
let 7r' n denote the distribution of /i™ in P(C([0, +oo), R)). In Lemma 1.3, we prove that the 
sequence (7r' n ) n >i is tight. Since the canonical application P(C([0, +oo), R)) — > C([0, +oo), P(R)) 
is continuous, then the sequence i~K n ) n >i of the distributions of the random mappings t \-¥ /i™ in 
C([0, +oo), P(R)) is tight. Let ir°° be the limit of a converging subsequence, that we still index 
by n for convenience. Lemma 1.4 combined with Proposition 1.2 proves that 7r°° concentrates on 
a single point P G Pi, which is such that the function (t,x) iH * P(t))(x) is a weak solution 
of (1). 

Proposition 1.2. Assume that the nondegeneracy condition (Dl) holds. 

(1) LetT> 0, possibly T = +oo. Let F 1 and F 2 G T(T), such that for all g G C c °°([0, T) x R), 
F 1 and F 2 satisfy (2). Then F 1 = F 2 in T{T). 

(2) There is at most one weak solution to the Cauchy problem (1) in J- ' . 

Proof. The second point of the proposition clearly follows from the first point, and the first point 
is proved in Appendix A. □ 

Lemma 1.3. The sequence (7r' n ) n >i is tight. 

Proof. Since the distribution of (X . . . , X n ' n ) in C([0, +oo), R") is symmetric, according to 
Sznitman [37, Proposition 2.2, p. 177], (7r'™)„>i is tight if and only if the sequence of the distri- 
butions of the variables X 1 '™ G C([0, +oo),R) is tight. This latter fact classically follows from the 
fact that for all n > 1, Xq' 71 = Aq has distribution m on the one hand, and from the Kolmogorov 
criterion as well as the boundedness of the coefficients a and b and the sequence (c n )„>i on the 
other hand. □ 



Lemma 1.4. Under the assumption that m has a finite first order moment, the distribution tt°° 
is concentrated on the set of mappings P G Pi such that the function (t, x) i— > iH * P[t)){x) is a 
weak solution to the Cauchy problem (1). 
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Proof. We first prove that tt°° concentrates on V\. Let be a variable in C([0, +00), "P(R)) with 
distribution 7r°°. We will prove that for all t > 0, 



sup / \x\/j,°° (s)(dx) < +00 a.s., 
se[o,t] Jr 

so that taking t in a countable unbounded subset of [0, +00) yields £ V\ almost surely. 

Let t > 0. For all M > 0, the function Jm : fi i-> sup sg j o t i / K (|a:| A M)/j,(s)(dx) is continuous 
and bounded on C([0, +00), T-'(R)). For fixed n, 



n i=i Ve[o,t] / 

. 1 n 

< / |a:|m(da:)+t||6|| 00 + - V 



E sup 

\se[o,t] 



, 1/2 



where we have used the Cauchy-Schwarz inequality in the second line and the Doob inequality as 
well as the fact that m has a finite first order moment in the third line. The constant C depends 
neither on M nor on n. As a consequence, liminf j\/->+oo E(fM(/j,°°)) < C and by Fatou's lemma, 

C > E ( liminf /m(m°°) ) > E ( sup liminf / (b| A M)/i°° (s) (dx) ) . 
y M ^+oo ) \ S £[o,t] M ^+°° Jr J 

By the monotone convergence theorem, 



liminf / (\x\ AM)fi°°{s)(dx) = lim / (\x\ A M)fi°° (s)(dx) = / |a; | (s) (dx) , 

so that E ^sup se [ t ] J R |x|^°°(s)(da;)^ < C, which yields the expected result. 

It now remains to prove that (t,x) h-> (H * /j,°°(t))(x) is almost surely a weak solution of the 
Cauchy problem (1). The computation is made for the porous medium equation in [2- r ), Lemma 
1.5] and can be straightforwardly extended; it relies on the uniform continuity of a and b on [0, 1] 
and the fact that c„ — > 0. □ 

Remark 1.5. Without assuming neither (Dl) nor the existence of a first order moment for m, one 
can still prove that the sequence (7r™)„>i is tight and that the limit of any converging subsequence 
concentrates on weak solutions to the Cauchy problem (1), that of course not necessary belong to 
J- . Thus, the existence of weak solutions holds under very weak assumptions. 

We conclude this subsection by discussing the regularity of the probabilistic solution F. For all 
finite T > 0, we denote by C'l' 2 {[0, T] x R) the set of C 1 - 2 functions on [0, T]xR that are bounded 
together with their derivatives. For all I > 0, the Holder spaces H l (R) and H l / 2 > 1 ([0, T] x R) are 
defined as in [30, p. 7]. 

Lemma 1.6. Assume that the uniform ellipticity condition (D3) and the regularity condition (R2) 
hold, that m has a finite first order moment and that H * m is in the Holder space H l (R), with 
I = 3 + f3. Then for all finite T > 0, the probabilistic solution F to (1) is in ([0,T] x R). In 
particular, it is a classical solution to (1). 

Proof. Fix a finite T > 0. Then owing to the assumptions (D3), (R2) and on the regularity of 
H * m. the classical result of Ladyzenskaja, Solonnikov and Ural'ceva [30, Theorem 8.1, p. 495] 
ensures that the Cauchy problem in divergence form 



(10) 



( l -a{F t {x))d x F t {x) - B{F t {x)) 



F (x) = H * m(x), 

admits a classical bounded solution F, which belongs to the Holder space #'/ 2 ' Z ([0,T] x R), with 
I = 3 + (3. Certainly, F satisfies (2) for all g £ C c °°([0,T) x R). Let us now prove that F £ T(T). 
On the one hand, by the maximum principle [30, Theorem 2.5, p. 18], for all t £ [0,T], ||-Ft||oo < 1- 
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On the other hand, the space derivative p := d x F is C 1 ' 2 on [0, T] x R and satisfies the linear 
parabolic equation 

d t pt(x) = a(t, x)dlp t {x) + b(t, x)d x p t {x) + c(t, x)p t (x), 

where 

a(t,x) := -a(F t (x)), 

b(t,x) := ^a'(F t (x))d x F t (x) - b(F t (x)), 

c(t,x) := ±a"(F t (x))(d x F t (x)) 2 - b'(F t (x))d x F t (x). 

The coefficients a, b and c are continuous and bounded in [0, TjxR and, due to the condition (D3), 
the operator is parabolic. By the maximum principle [17, Theorem 9, p. 43], and since po > 0, 
then p t {x) > for all (t, x) £ [0, T] x R. 

As a consequence, for all t £ [0,T], pt is the density of a nonnegative bounded measure on 
R, with total mass lower than 1. Let us now prove that the mapping t i— > pt{x)dx is continuous 
for the topology of weak convergence. Since p is continuous on [0,T] x R, the mapping t n- 
pt(x)dx is continuous for the topology of vague convergence. Besides, as, for all s,t £ [0, T], 
sup xgR |.Ft(x) — -^(x)! < | IcJti 71 ! |oo I* — s\, the total mass 1 1-> J" R p t (x)dx is continuous. 

Hence, the continuous mapping i n- p t (x)dx is a measure- valued solution to the linear Fokker- 
Planck equation 

dm = l -d 2 x (a{F t {x))^ t ) - d x {b(F t (x))nt), 

the coefficients of which are measurable and bounded functions on [0, T] x R. Therefore by Fi- 
galli [16, Theorem 2.6], and since po is a probability density, then for all t £ [0, T], p~ t is the density 
of the distribution of X t , where (X t ) te r 0] Tl is a weak solution to the stochastic differential equation 

X t =X + I b(F s (X s ))ds+ f a(F s (X s ))dW s , 
Jo Jo 

where X has distribution m and is independent of the Brownian motion W. Now one easily 
deduces from the assumption that m has a finite first order moment and from the boundedness of 
a and b that F £ J-(T). Therefore, by the first part of Proposition 1.2, F is the restriction to [0, T] 
of the probabilistic solution F to (1) given by Proposition 1.1. Hence, F £ C£' ([0, T] x R) and the 
fact that F is a classical solution to (1) now follows from the fact that T is arbitrarily large. □ 

Remark 1.7. The regularity assumption on the initial condition H *m is far from being necessary 
for the probabilistic solution F to have the classical regularity. For instance, it is known for the 
case of the viscous conservation law that F has the classical regularity even for a discontinuous 
initial condition H * m (see [28, Corollary 1.2]). 

1.2. The nonlinear martingale problem. The propagation of chaos result of Proposition 1.1 
only deals with the flow of time marginals of the empirical distribution fj, n . A natural further ques- 
tion is the convergence in V(C([0, +oo), R)) towards the solution to a proper nonlinear martingale 
problem. 

Recall that the distribution of the random variable pL n in "P(C([0, +oo), R)) is denoted by 7r' n , 
and by Lemma 1.3, it is tight. Let X refer to the canonical process on the probability space 
C([0,+oo),R), namely X t (u) := w t for all u € C([0, +oo), R). We shall also denote by Cg(R) the 
space of C 2 functions <j> : R — > R such that </>, <jjf and </>" are bounded. 

Definition 1.8. A probability distribution P £ "P(C([0, +oo), R)) is called a solution to the non- 
linear martingale problem if: 

• Pq = m; 

• for all <j) £ C 2 (R), the process defined by Aff := <j){X t ) - 4>{X Q ) - J* L(P s )(j)(X s )ds 
is a P -martingale, where, for all fi £ "P(R) ; L(fi)(j)(x) := b(H * /i(x))^>'(x) + {l/2)a{H * 
/i(x))0"(x); 

• for almost all t > 0, P t does not weight points. 
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Following [25, Lemma 1.2], if P is a solution to the nonlinear martingale problem, then the func- 
tion (t,x) i — y H*P t (x) is a weak solution to the Cauchy problem (1). Besides, since the coefficients 
a and b are bounded, it is easily seen that if m has a finite first order moment, then this solution 
belongs to J 7 , therefore it coincides with the probabilistic solution F given by Proposition 1.1. 

Owing to Levy's characterization of the Brownian motion, a probability distribution solving 
the nonlinear martingale problem is the distribution of a weak solution to the nonlinear stochastic 
differential equation (4). Reciprocally, the distribution P of a weak solution to (4) is a solution 
to the nonlinear martingale problem if and only if for almost all t > 0, P t does not weight points. 
When there exists a unique solution to the nonlinear martingale problem, we will refer to the 
associated weak solution X of (4) as the nonlinear diffusion process. 

Let us first investigate the existence of a solution to the nonlinear martingale problem. 

Lemma 1.9. Under the nondegeneracy condition (D2) and the assumption that m has a finite 
first order moment, the limit of any converging subsequence o/(7r'") Il >i concentrates on the set of 
solutions to the nonlinear martingale problem. 

Proof. By Lemma 1.3, the sequence (%' n ) n >i is tight. Let 7r'°° denote the limit of a converging 
subsequence, that we still index by n for convenience. Let Q refer to the canonical variable in the 
probability space "P(C([0, +oo), R)). Since the variables Xq are i.i.d. with marginal distribution 
m, then 7r'°°-a.s., Qo = m. Let us now prove that 7r'°°-a.s., for almost all t > 0, Qt does not 
weight points. By Proposition 1.1, 7r'°°-a.s., for all t > one has H * Q t = F t where F is the 
probabilistic solution to the Cauchy problem (1). Therefore it is enough to prove that, for almost 
all t > 0, the function Ft is continuous on R. In this purpose, we first remark that the mapping 
t i-> P(t) solves a linear Fokker-Planck equation. Indeed, since A and B are C 1 on [0,1], the 
functions (t,x) h-> A(Ft(x)), (t,x) i-> B(Ft(x)) are of finite variation and the associated Stieltjes 
measures write d(A(F t (x))) = a(t,x)P(t)(dx), d(B(F t (x))) = b(t, x)P(t)(dx), where 

( a{F t {x)) if Ft is continuous in x, 

a(t,x) := J A(F t (x)) - A(F t (x-)) 



F t (x)-F t (x-) 



otherwise, 



and b(t, x) is similarly defined. Remark that the functions a and b are bounded, and ||a| 
IHloo < IHloo- As a consequence, the continuous mapping t H> P{t) is a measure-valued solution 
on [0, +oo) to the Fokker-Planck equation with measurable and bounded coefficients 

dtP(t) = l -d 2 x {a{t lX )P{t)) - 8 x (b(t,x)P(t)). 

Fix a finite T > 0. Then by Figalli [16, Theorem 2.6], there exists a probability distribution 
P G V(C([0,T],R)) such that, for all t € [0,T], P t = P(t) and P is the distribution of a weak 
solution (Xt) t £[o,T] on [0,T] to the stochastic differential equation 

t pt 



Xt = X + f b(s,X s )ds+ f a{s,X s )dW s , 
Jo Jo 



where a(t, x) := a{t, a;) 1 / 2 . The process (X t ) t ^[o,T] satisfies the condition of Bogachev, Krylov and 
Rockner [(>, Remark 2.2.3, p. 63]. Hence the positive measure a(t, x)Pt(dx)dt admits a density 
p(t,x) € Lj 2 oc ([0,T] x R). Now for all t € [0,T], F t is the cumulative distribution function of X t so 
that Ft(x) > 0, Pt-almost everywhere. By (D2) and the definition of er, we deduce that JVahnost 
everywhere, a(t, x) > 0, therefore 

p(t,x) 

P t (dx)dt = l{ S (t,x)>o} -u \ dxdt 
a(t, x) 

and consequently, P t {dx) admits a density for almost every t <G [0, T]. We conclude by taking T 
arbitrarily large. 

We finally prove that 7r'°°-a.s., for all cj) e C^(R), the process defined by hlf := <f>(X t ) - 
<fi(X ) — J* L(Q s )<j)(X s )ds is a Q-martingale. We will proceed as in the proof of [25, Lemma 1.6]. 
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Let <j) G Cb(K), k > 1, < si < • • • < Sfc < s < t and g : R fe -» R continuous and bounded. For all 
Q G P(C([0,+oo),R)), we define 

G(Q) := (Q, g(X Sl ,...,X Sk ) U(X t ) - 4>(X S ) - J L(Q r )0(X r )dr 

By Ito's formula, for all n > 1, 

1 " / r* 



so that, since (c„)„>i, <r, g, <^>' and 4>" are bounded, lim„_,. +00 E(t/(/i™) 2 ) = 0. We now check that 
the functional Q is continuous at all P G P(C([0, +oo),R)) such that, for almost all r > 0, P r does 
not weight points. Let {P q ) q >\ be a sequence of probability distributions on C([0, +00), R) weakly 
converging to P G P(C([0, +oo), R)) such that, for almost all r > 0, P r does not weight points. 
Then, for all q > 1, 

SOP 9 ) = (p 9 , fl(X Sl ,...,A S J Al(P«) - L(P r ))0(X r )dr 
(11) X Js t 

+ (p\g(X Sl ,...,X Sk )(cf ) (X t )-cj > (X s )- [ L(P r )cb(X r )dr 



On the one hand, as g and the derivatives of </> are bounded, there exists C > independent of q 
such that, for all q > 1, 

P«, g(X Sl , . . . , X Sk ) j\h{P?) - L(P r ))d>(X r )dr 

< C [ fsup * P?{x)) - b(H * P r (x))\ + sup \a(H * P?{x)) - a(H * P r (x))\] dr. 

By Dini's theorem, for almost all r E [s,t], H * P^ converges uniformly to H * P r on R when 
q — > +oo. As the functions b and a are bounded and uniformly continuous on [0, 1], by Lebesgue's 
theorem, the right-hand side above goes to when q — > +oo. 

On the other hand, for almost all r £ [s,t], the function x i-> L{P r )4>(x) is continuous on 
R and uniformly bounded in r, therefore by Lebesgue's theorem again, the function (a; r ) r >o ^ 
g{x Sl , . . . , x Sk )(4>{x t ) — <p(x s ) ~ f* L(P r )<f>(x r )dr) is continuous on C([0, +oo), R). As a conse- 
quence, the second term in the right-hand side of (11) converges to Q{P) and we conclude that 

iim«^ +00 s(p<o =g(p). 

Since we have proved that 7r ,00 -a.s., for almost all r > 0, Q r does not weight points, then 

lim W' n {g{Qf) = E^°(£(Q) 2 ), 

n— >+oo 

which rewrites 

E"'~(£(Q) 2 ) = lim E(£(//*) 2 ) = 0. 

n— >+oo 

As a consequence, taking 0, (si, . . . , s&, s, t), g in countable subsets leads to the conclusion that 
7r ,00 -a.s., Q solves the nonlinear martingale problem. □ 

We now address the uniqueness of solutions to the nonlinear martingale problem. The following 
criterion is due to Stroock and Varadhan [36]. 

Lemma 1.10. Under the assumptions of Proposition 1.1, if the function (t,x) ^ a(Ft(x)) is 
uniformly positive on the compact sets of [0, +oo) x R, then there is at most one solution to the 
nonlinear martingale problem. 

Proof. Let P and Q denote two solutions to the nonlinear martingale problem. Then they both 
solve the following linear martingale problem in R E P(C([0, +oo),R)): 

• Rq = m; 
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• for all 4> g Cg(R), the process defined by 

Mf := <t>{X t ) - tf>(X ) - J ^'(X s )b(F s (X s )) + ±<j>"(X s )a(F s (X s ))\ ds 
is a i?-martingale. 

The functions a(F s (x)) and b(F s (x)) are measurable and bounded, and a(F s (x)) is uniformly 
positive on the compact sets of [0, +00) x R. By [36, Exercise 7.3.3, p. 192], P = Q. □ 

Lemma 1.11. Assume that m has a finite first order moment, and either the uniform ellipticity 
condition (D3) holds, or the nondegeneracy condition (D2) holds and Fq(x) > for all x g R. 
Then the function (t,x) 1— > a(F t (x)) is uniformly positive on the compact sets of [0, +00) X R. 

Proof. If (D3) holds, the result is obvious. Now if Fq(x) > 0, for all compact subset K g [0, +00) x 
R, by the first part of Lemma B.l in Appendix B, there exists iio > such that for all (t, x) g K, 
F t (x) > u . If (D2) holds in addition, for all (t, x) g K, a{F t {x)) > inf u >„ a(u) > 0. □ 

We conclude this subsection by proving a propagation of chaos result for the empirical distri- 
bution fi n of the particle system in "P(C([0, +00), R)). 

Corollary 1.12. Under the assumptions of Lemma 1.11, there exists a unique solution P to 
the nonlinear martingale problem, and it is the limit in probability, in P(C([0, +00), R)), of the 
sequence of empirical distributions \x n . 

1.3. The reordered particle system. For all t > 0, let Y t n := (Y t ' n , . . . , Y™ ,n ) denote the 
increasing reordering of the vector (X t ' n , . . . , X™' n ). Then the sample-paths of the process Y n are 
in C([0, +00), D n ), where D n refers to the polyhedron {(yi, . . ■ , y n ) € R" : Vi < ■ ■ ■ < Dn}- 

It is well known that Y n is a normally reflected Brownian motion on dD n , with constant 
drift vector and constant diagonal diffusion matrix. More precisely, according to [25], the process 
(/3 1 , . . . ,f3 n ) defined by (3\ = 5Zj=i Jo ^-{x j,n —x i:n }^^s is a Brownian motion. By the Ito-Tanaka 
formula, 

(12) Yl' n = Y*' n + b(i/n)t + (c + a{i/n))f3\ + V t \ 

where V is a R n - valued continuous process with finite variation \V\ which writes V t l = J Q (7* — 
ji +1 )d\V\ s with d|y| t -a.e., 7^ = 7™" 1 " 1 = 0, 7* > and ~/i(Y?' n - Y^ hn ) = 0. We shall now refer to 
the process Y n as the reordered particle system and denote by fi n g V(C([0, +00), R)) its empirical 
distribution. 

Lemma 1.13 (Tanaka [38]). For a given random variable Yq € D n and an independent R 71 - 
valued Brownian motion (/3 1 , . . . , /3"), there exists a unique process (Y n , V) G C([0, +00), D n x R n ) 
satisfying all the above conditions. 

For Q g P(C([0,+oo),R)) and ti,...,t k > 0, let us denote by Qt u ...,t k € V(R k ) the finite- 
dimensional marginal distribution of Q. Let us define A as the set of probability distributions 
Q g P(C([0, +00), R)) such that, for all < t\ < ■ ■ ■ < t k , Qt u ...,t k is the distribution of (H * 
Qt 1 ) _1 (f7), . . . , (H * Qt fc ) _1 (C^)) where U is a uniform random variable on [0, 1]. Remark that any 
Q g A is exactly determined by the flow of its one-dimensional marginals t >-> Qt- 

Proposition 1.14. Under the assumptions of Proposition 1.1, the empirical distribution \x n of the 
reordered particle system converges in probability, in V(C([0, +00), R)), to the unique P G A such 
that for allt > 0, P t = P(t), where the mapping t <— > P(t) is given by Proposition 1.1. In particular, 
for all t > 0, H * P t — F t where F is the probabilistic solution to the Cauchy problem (1). 

Proof. Let 7r™ refer to the distribution of fl n in V(C([0, +00), R)). According to Sznitman [37], 
the tightness of (7r n ) n >i is equivalent to the tightness of the sequence of the distributions of the 
variables Y e ™' n £ C([0, +oo),R) where 6 n is a uniform random variable in the set {l,...,n}, 
independent of Y n . For all n > 1, Y has distribution m. Besides, for s < t and p > 1, 

1 n 

e - = - E E (\ y s' n ~ Y t ,n \ p ) = E (% p (m:,mD) ■ 

i=l 
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Now, by (9) and exchangeability of (Xl' n ) t >o, ■ ■ ■ , (X"'") t > , 



E 



1 n 

< -^E(|Xr-Xi'T) =E(\X}> n -Xi- n \ 



so that, by the proof of Lemma 1.3 and the Kolmogorov criterion, the sequence (w n ) n >i is tight. 

It is clear from the definition of the reordered particle system that for all t > 0, /i™ = /2™, 
therefore it is already known from Proposition 1.1 that /2™ converges in distribution to P(t). 
Consequently, the proof of Proposition 1.14 requires nothing but a uniqueness result for the support 
of any limit point of (7r™)„>i. 

The latter is a consequence of the following remark. Certainly p" <E A, and the set A is closed 
in V(C([0, +oo), R)) by [26, Lemma 3.5]. Hence, any limit point of (7r™)„>i is concentrated on the 
unique probability distribution P £ A such that for all t > 0, P t = P(t). □ 

Remark 1.15. In a recent work [35], Shkolnikov studies the original particle system (5) with the 
specific condition that the spacing process (Y^' n — Y t ' n , . . . , Y™' n — Y t n ~ 1,n ) t >o be stationary (the 
description of its stationary distribution is given in [22]). Then in the case where a is affine and 
b is C 1 , with b' uniformly negative, the flow of time marginals of the empirical distribution of the 
particle system is proven to converge in probability, in C([0, +oo), 'P(K)), to the unique mapping 
t i y P(t) such that (t, x) M> (H * P(t))(x) is a weak solution to the Cauchy problem (1) for a 
specified initial condition F . In place of our Proposition 1.2, the author uses Gilding's theorem 
for uniqueness [18, Theorem 4] and therefore needs to assume that any weak solution to the Cauchy 
problem is continuous on [0, +oo) x R when m does not weight points. As our proof shows, neither 
the continuity assumption on the solution nor the stationarity assumption on the initial condition 
are necessary, and the propagation of chaos result holds under more general conditions over the 
coefficients a and b as well as the initial condition. 

1.4. Propagation of chaos in Wasserstein distance. The original particle system defined 
by (5) is exchangeable, therefore the propagation of chaos result stated in Proposition 1.1 implies 
that the distribution P t ' n of X\' n converges weakly to P(t) in V(M). This convergence result can 
be strengthened in Wasserstein distance. 

Corollary 1.16. Under the nondegeneracy condition (Dl) and the assumption that m has a finite 
moment of order p > 1, then P t and P(t) have a finite moment of order p, and 

lim W p (P t hn ,P(t)) =0, lim E[W£(/£,P(t))]->0. 

Proof. Let t > 0. As just seen before the corollary, P t ' n converges weakly to P(t). To prove 
that this convergence holds in the Wasserstein distance of order p, it is sufficient to prove that the 
sequence (|A" t 1,Tl | p )„>i is uniformly integrable (see Villani [39, Theorem 6.9, p. 108]). For all q>l, 



(13) E(\Xl' n -X^)<2«- 1 ((*||&||oo)*+E 



{c n + o-{H*tf{X]^)))&Wl 



<C, 



where C does not depend on n. Thus, the sequence i\X\ ,n — Xq | p ) n >i is uniformly integrable, and 
since \Xl' n \P < 2 p - 1 (\Xl' n - X^\ p + \X£\p) then the sequence (|X t 1 '"|P)„>i is uniformly integrable. 
Therefore P t ' n and P(t) have a finite moment of order p and W p (P t P(t)) — > 0. 
Let M > 0. Then, by (8), 

pi/n n fi/n 

A M)du 



W*{tf,P{t)) = Y J / (\Yr-F t -\u)\P-M)+du + J2 / (\Yr-F t -\u)\r. 

l=l J(i-l)/n j=1 J(i-l)/n 

<J2 / ^ - F t -\u)\n {lYr _ F - 1{u)lP>M] du + J2 / (\Yr-F t ~ 1 (u)\PAM)di 

i=l J(i-l)/n i=1 J(i-l)/n 

On the one hand, 

n pi/n /*1 

E / (\Yr-F t -\u)\PAM)du= / (\(H*^)- 1 (u)-F t - 1 (u)\ p AM)du, 
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and the function /i <G 'P(R) i-> J * ^ l )^ 1 ( u ) ~ -Pt~ 1 ( u )l p A M)dii is continuous and bounded. 
Therefore, by Proposition 1.1, 



V / 



y^"-F t - 1 (u)| p AM)du =0. 

)/« / 



On the other hand, remarking that for all x, y G R, 

\X - y\ P l{\x-y\P>M} < N - y| P l{| K |>|y|VJ\/i/P/2} + \x - y\ P l{\y\>\ x \v M*/p /2} 

< 2 p \x\n UxlP > M/2P} + 2 p \y\n {lylP > M/2Ph 



(14) 
we write 



i=l J(t-l)/n 

n /*i 

- ~~~ X/ l^'"| Pl {|Y t , '"|p>M/2P} + 2P / \ F t 1 ( u )\ P1 {\F- 1 (u)\p>M/2p}^ U 
H i=l J ° 

^ Iffi | Pl {|A f i '"|p>A//2P} + 2? / l^t 1 ( - ")| Pl {|F t " 1 (ti)|P>M/2P}^ U - 
■ i ^/ 



(=1 

2 P 
n 



We deduce from the exchangeability of the variables X t , X"' n , the uniform integrability of 
(|X t 1 ' n |P) n > 1 and the finiteness of J 1 |F" 1 (u)|PdM = L |x| p P(i)(da;) that 



(" /.i/n 
E / \Yr-F t -\u)\n {[Y <, 
~iJ<,i-l)/n V * 



A/^+oo n >! \~{J(i-l)/n 11 * / 

so that E[Wg(t4, P^))} -> 0. □ 
2. Contraction of the Wasserstein distance between two solutions 

Let Fq and Go be the cumulative functions of two probability distributions with a finite first order 
moment. Under the condition (Dl), by Proposition 1.1 there exist a unique probabilistic solution 
F to the Cauchy problem (1) with initial condition Fq, and a unique probabilistic solution G to 
the Cauchy problem (1) with initial condition Go- This section addresses the behaviour of the flow 
t i y W p (Ft,Gt). In Proposition 2.1 we prove that it is nonincreasing if W p (Fq,Gq) < +oo, using 
only the contractivity of the reordered particle system. Then, assuming the classical regularity of 
F and G, we provide an explicit expression of the time derivative of the flow 1 1-> W£(F t , Gt). 

We point out the fact that we will sometimes call expectation or moment of a cumulative distri- 
bution function the expectation or the moment of the derivated probability distribution. 

2.1. Monotonicity of the flow. We first deduce from a natural coupling between two versions 
of the reordered particle system that the flow t n- W p (F t , Gt) is nonincreasing. 

Proposition 2.1. Assume that the nondegeneracy condition (Dl) holds and that Fq and Go have 
a finite first order moment. Then, for all p > 1, 

• if W p (F , Go) < +oo, then the flow 1 1— > W p (F t , G t ) in nonincreasing; 

• if W P (F , G ) = +oo, then for all t > 0, W p (F t , G t ) = +oo. 

Proof. We deduce the monotonicity property from the contractive behaviour of the reordered 
particle system. Let (J3 1 , . . . be a R^-valued Brownian motion and let U 1 , . . . , U n be inde- 
pendent uniform variables on [0, 1]. Let us denote by (U^\ . . . , f/W) the increasing reordering of 
(U\...,U n ). For aU 1 < i < n, let F F,i := F _1 (C/W) and F G,i := Gq^C/W). By Lemma 1.13, 
there exists a unique strong solution (Y F , V F ) G G([0, +oo), D n x R n ) to the reflected stochastic 
differential equation 

Y F,i,n = Y F,i + b{l/n)t + + a{l/nm + yf^ 

and similarly, we denote by (Y G ,V G ) the unique strong solution in G([0, +oo), D n x R") to the 
reflected stochastic differential equation 

Y G,i,n = Y G,i + h[i/n)t + + a{i/n)) pi + yG,\ 
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By the beginning of Subsection 1.3 and the Yamada-Watanabe theorem, the process Y F,n (resp. 
Y G ' n ) has the same distribution as the increasing reordering of the particle system X F,n (resp. 
X solution to (5) with initial conditions i.i.d. according to the cumulative distribution function 
Fq (resp. Go). In particular, the propagation of chaos result of Proposition 1.14 applies to the 
empirical distributions j2 F ' n := (1/n) X)"=i <V F .*.*» & n d jjP' n := (l/«) X)"=i ^Y a '*>»• 

Now, for all t > 0, (9) yields W*(fi[' n ,jj!?' n ) = (l/n)£" =1 l^f'*'" " Y^'T- The following 
inequality, the proof of which is postponed below, is crucial: 

(15) vo< s <t, wv{& n r£n<w${n F > n ,tf> n )- 



Case W p (Fo,Gq) < +oo. If both Fo and Go have a finite moment of order p, then owing to 
Corollary 1.16, one can extract a subsequence along which Wg(jif' n , jl t '") goes to W^(F t , Gt) and 
W$ (/if'™, A? '") g° es to WP(F S , G s ) almost surely, then conclude by using (15). 

Assuming only Wp(Fo,Go) < +oo, we shall now proceed as in the proof of Corollary 1.16 to 
show that for all i > 0, 

lim E(V^(/}f' n , /i G ' n )) = W?{F t , G t ), 

which results in the claimed assertion thanks to (15). 
For all M > 0, by (9) we write 



1 

tt/d / r"i,7i ~G.n\ x \ ^ i^yF.i.n \r 

(/x* ' , Mi ) = - V *t ~ *t 
F n * — ' 

i=l 

1 n 

i^(|y/A»_yGA*|P AM)+ Ay 



71 

i=l 



F t i,n_ Y G t i,n lP _ M ] + 



By Proposition 1.1, /if'™ converges in probability to the probability distribution dF t with cu- 
mulative distribution function F t , and similarly p, t '" converges in probability to dGj. Therefore, 
the couple (fl t ' n ,fl t ' ) converges in probability to (di^dGt) and 



hm Efl^(|y t ^'"-y t G ' I '"rAM) N ) = f 

By the monotone convergence theorem and (7), 



(\Ff\u)~G; 1 (u)\PAM)d 



u. 



hm hm E (-\2(\Y t F ' l ' n - Y t G '^ n \ p A M) ] = W£ (F t , G t ) e [0, +oo]. 

-> + oo n-y + oo \n f— ^ y F 



It now remains to check that 



(16) lim E [ - V(|lf "'" - y t G ^"|P - M) 



hm E (±£(|if A "-y t £ 

\ »=1 

Using (14) twice results in 

1 ™ + 1 ™ 



n '■ — ' \ / n 

i=l i=l 



|p>M} 



4P . 

- n / A x t I o I - L {|F t F ' , '"-y F -'|p>M/4!'} 



n . 

z— 1 



— \Y G ' i ' n - Y G ' i \ p 1 r- r- 

i=l 
OP ^ 

V* lv F >« _ v g '*ipi 

Z^i I r I - L {|Y ( f- I -y G - ! |p>M/2P}- 



i=l 
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On the one hand, by the construction of Y Q F ' 1 and Y Q ' l , 



-Y°'*\p>M/2>>} 



( 1 - 

\ i=l 

/ j n 

= E - E l^ 1 ^ ) - ^ 1 (^ (4) )| P l { |^-( ff W)-G-( ff « 

V i=l 

/l " 

= E \n E l^o" 1 ^) - G 1 ( [/ ')| Pl {|F- 1 (C/*)-G- 1 (C/i)|P>M/2P} 

V »=i 

IFq" 1 ^) - Gq 1 (u)\ p l {lF -i (u) _ G -i (u)lP > M/2P} du, 



)\p>M/2p} 



and the right-hand side does not depend on n. Since W p (Fq,Gq) < +oo, it goes to when 
M — > +oo. On the other hand, 

E (i g |5f A . _ ^ rir ,^ w J < | E (i g w - Y F r^ 

where we have used the inequality (9) in the last line. By (13), there exists C > independent of 
n such that EQxf'*' n - X^' i \P+ 1 ) < C. Then 

aA^J™ S ^ E E \ Y t F ' l,n ~ ^Tl { |^-_y -| P >M/4n) = °5 

and likewise, 



( 1 " 

lim lim sup E — I Y t 



G,i,n yG,i\ p -. 

I I - L {|Y t G ' l '"-Y G ' , |p>M/4P} 



which completes the proof of (16). 

Case W p (Fo,Gq) = +oo. By the triangle inequality, 

W P (F ,G ) < W p (F 0l F t ) + W p (G ,G t ) + W p (F t ,G t ). 

According to Proposition 1.14, there exists a subsequence (that we still index by n for convenience) 
along which fl ' n converges to the distribution with cumulative function Fq almost surely in V(M), 
and fif' n converges to the distribution with cumulative function F t almost surely in 'P(K). Recalling 
that the Wasserstein distance is lower semicontinuous on V(M) (see [39, Remark 6.12]), we get 
W p (Fq, F t ) < Uminfn-^+oo W p (p, ' n ,p, t so that by Fatou's lemma, (9) and (13), 

W?(F , F t ) < lim inf E (- £ 1^'" - ^'T ) < +oo, 

\ i=l / 

and similarly W p (Gq, G t ) < +oo. As a consequence, if W P (F , Go) = +oo then W p (F t ,G t ) = +oo. 
Proof of (15). Recall that 

yF,i,n = yF,i + b{l/n)t + + a{l/nm + V F t i t 
Y G,i,n = yG,i + b{l/n)t + + a{l/nm + V G,i } 
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with dV Fl = ( 7 f' 1 - Jt' i+1 )d\V F \ t , dV G > % = (7 t Gli - 7 f ' 1+1 )d\V G \ t . Thus, for all 1 < i < n, the 
process Y F ' l ' n — Y G ' l ' n is of finite variation, hence 

n n 

\Y t F ' i ' n - Y G ' l ' n \ p =pJ2 l*f - Y t G ' hn \ p - 2 (Y t F ' hn - Y G ^ n )d{Y Fl ' n - Y G ' hn ) 

i=l i=l 

n 

= pJ2 ^ - Y G ^r\Y F ^ - Y G ^)(^ - ^ +1 )d\V F \ t 

n 

+ P Y / \y t G ' hn - Y F ^r 2 (Y G ^ - if An )( 7 f - i? M1 )d\v G \ u 

i=l 

where, for all p > 1, we take the convention that |z| p ~ 2 z = when z = 0. The two terms of the 
last member above are symmetric; we only deal with the first one, which we rewrite «Std|l^|t. By 
the Abel transform, S t = pYh=2 It '* u 0f ' Y t ' % ~ l ' n , Y t F,hn , Y G ' l,n ), where u(xi, yi, x 2 , y 2 ) := 
\X2 7- y2\ p ~ 2 (x2 - Ift) - - 2/i| p " 2 (xi - j/i). Recall that d|^| t -a.e., 7 f' 1 > and 7 f'W *>" - 
if = 0, and remark that for fixed 1/1 < y 2 , the expression u(x, yi, x, y 2 ) remains nonpositive 
when xel. Then d|V F |t-a.e., St < and the proof of (15) is completed. □ 

2.2. Derivative of the flow. We will now compute the time derivative of the flow t H> WS{F t , Gt) 
when F and G have the classical regularity. We first derive a nonlinear evolution equation for the 
pseudo- inverse function Ff 1 . Of course, the same holds for G, . 

Lemma 2.2. Assume that the uniform ellipticity condition (D3) and the regularity condition (Rl) 
hold, that Fq has a finite first order moment, that the probabilistic solution F to the Cauchy 
problem (1) with initial conditon Fq has the classical regularity and that for all < t\ < t 2 , the 
function (t,x) 1— > d x F t (x) is bounded on [ti,t 2 ] x K . Then the pseudo-inverse function (t,u) t— > 
Ff (u) is C 1 ' 2 on (0, +00) x (0, 1) and satisfies 

(17) d t F t -\u) = b(u) - \d u ( " ( j 

2 \d u F t (u) 

Proof. On the one hand, since F is a classical solution to (1), then 

(18) d t F t (x) = i (a'(F t (x))(d x F t (x)) 2 + a(F t (x))d 2 x F t (x)) - b(F t (x))d x F t (x). 

On the other hand, the lower bound in the Aronson estimate (47) allows to apply the implicit 
functions theorem to (t, x, u) n- F t (x) — u and deduce that (t, u) M- Ff (u) is C 1 ' 2 on (0, +00) x R. 
Besides, 

(19) d u Fr 1 (u) = L, and &Fr l (u) = - ^I^llMl . 

^ ' * K ' d x F t {F^{u)) u 1 K ' [d x F t (Ff (u))] 3 

For all t > 0, since F t is a continuous function then F t (Ff 1 (u)) = u, so that derivating with 
respect to t yields = dtF^Ff 1 («)) + d x F t (F^ x (u^dtFf 1 (u). Using (18) and (19), we write 

, 7 / \ 1 / o'(u) a{u)dlF t -\u) 



d t F t -\u) = b(u) 



2 u^rV) [au^rH")] 2 

from which we deduce (17). □ 

For all function / : K -> [0, 1], we define the function tl(/, ■) : R -> [0, 1] by 

tl(/, 1) := 1 {;c >o } (1 - f{x)) + l {x < 0} f(x). 

Then the main result of this subsection is the following. 

Proposition 2.3. Assume that the conditions of Lemma 2.2 are satisfied with both F and G. Let 
p>2 such that W P (F ,G ) < +00 and |x| p_1 (tl(F , x) +tl(G ,x)) -> when x -> ±00. Then for 
all < ti < t 2 , 

W p (F t21 G t2 )-W p (F tl ,G tl ) 

= -^> r f 1 a(u)\F t -Hu) - cri(u)M ; *%<f**L 

2 J tl Jo d u F^{u)d u Gi\u) 
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Proof. See Appendix B. 



□ 



Remark 2.4. A straightforward strategy to prove Proposition 2.3 is the following: for ei,C2 G 
(0, 1/2), let 



\F^{u)-G^{urdu, 



then certainly W* tlltta (F u G t ) -> WP(F t ,G t ) when ei,e 2 -> 0. Now, 
d 



dt 



W? tl , t2 (F t ,G t ) 







= p 


L 1 






_ p 


/.]— e 2 




2 





Gr 1 ( u ))iF t - 1 M-G t - 1 ( u )r 2 a u 



a(u) 



a(u) 



i i 



du 



9„G t - x ( U ) <9 U -F t _1 (u) 



p{p- 1) 



a^M-'M-G^HI 



p-2 



■du, 



a u F t - 1 (u)5„G t - 1 (u) 

where we have used (17) at the second line and integrated by parts in the last line. Hence, 
Proposition 2.3 holds as soon as the boundary terms vanish, i.e. 



lim inf 

£l,£2->0 



a{F^-G^)\F^-G^r 



d u Gl l 



r) P -1 



dt = 0. 



However, we were not able to provide a rigorous account of this statement. In Appendix B, we use 
a different expression of W£(Ft, Gt) in terms of F t , Gt to compute the time derivative of the flow. 

Remark 2.5. Although the setting is different, the result of Proposition 2.1 obtained by a proba- 
bilistic approximation is comparable to the result of Carrillo, Di Francesco and Lattanzio [9, The- 
orem 5.1], the proof of which relies on the deterministic operator splitting method. In Lemma 2.2, 
the nonlinear evolution equation for the pseudo-inverse of the solution to the Cauchy problem 
generalizes in a rigorous way the proposed extensions of the work by Carrillo and Toscani [10, Sec- 
tion 3]. In Carrillo, Gualdani and Toscani [12], the time derivative of the flow of the Wasserstein 
distance between two solutions is computed by the method described in Remark 2.4 for the case 
of compactly supported solutions at all times, therefore the boundary terms necessarily vanish in 
the integration by parts. 

The same method is also applied by Alfonsi, Jourdain and Kohatsu-Higa [1], where the authors 
get rid of the boundary terms using Gaussian estimates on the density. As Lemma B.2 shows, such 
estimates still hold in our case as soon as the tails of Fq or Go are not heavier than Gaussian. Since 
we are willing to use Proposition 2.3 to compare Ft with the stationary solution Fqo, we would 
therefore need the tails of F^ not to be heavier than Gaussian. But according to Remark 3.5 
below, at least under the uniform ellipticity condition (D3), the tails of Foo cannot be lighter than 
exponential. 



3. Convergence to equilibrium 

This section is divided into three parts. In Subsection 3.1, we solve the stationary equation. In 
Subsection 3.2, we prove the convergence of solutions to stationary solutions. In Subsection 3.3, 
we discuss the (lack of) rate of convergence to equilibrium. 

3.1. The stationary equation. We recall that the stationary equation is the following: 

(20) ^(A(Foo(x))) - d^BiF^x))) = 

As mentionned in the introduction, the stationary solutions for the Cauchy problem are the cu- 
mulative distribution functions, with a finite first order moment, solving (20) in the sense of 
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distributions. In Proposition 3.1, we solve the stationary equation, and we give a criterion for 
integrability in Corollary 3.4. 

Proposition 3.1. Under the nondegeneracy condition (Dl), a necessary and sufficient condition 
for the existence of cumulative distribution functions solving the stationary equation is B(l) = 0, 
B(u) > and the local integrability of the function a/2B on (0,1). Then all the solutions are 
continuous. 

If in addition B{u) > for all u £ (0, 1), which corresponds to the equilibrium condition (El), 
then Foo is a solution if and only if there exists iel such that for all -Foo(^) = ^ (x+x), 

where the function ^ is defined by 

(21) VuG (0,1), *(«) := f $L dv. 



1/2 



2B(v) 



In this case, x = ^(F^O)). 



Proof of Proposition 3.1. We first prove the necessary condition. Let F be a cumulative distribu- 
tion function (we shall write F instead of F^ in the proof), solving (20) in the sense of distributions. 
Then there exists eel such that the function x M- (1/2) A(F(x)) is absolutely continuous with 
respect to the Lebesgue measure, with density B(F(x)) + c. Since by the condition (Dl), A is 
increasing, then F is continuous. Hence, B(F(x)) + c is a nonnegative, continuous and integrable 
function, so that taking the limit x — > — oo yields B(0) + c = c = 0. We deduce B(u) > and 
B(l) = by taking the limit x — > +oo. 

It remains to prove that a(u)/2B(u) is locally integrable in (0, 1). We use the convention that 
a(u)/2B{u) = when a(u) = B(u) = 0, and we define U := {u £ (0, 1) : B(u) = and a(u) ^ 0}. 
Let < a < (3 < 1 and X- := F^ 1 (a), x + := i ?_1 (/3). We denote by dF(x) the Stieltjes measure 
associated with the continuous function F of finite variation. Then by the chain rule formula [34, 
(4.6) p. 6], the Radon measure (l/2)a(F(x))dF(x) has the density B(F(x)) with respect to the 
Lebesgue measure. Therefore dF(x)-a,.e., F(x) E" U. By the change of variable formula [34, (4.9) 
p. 8], 

■ a(F(x)) . , /■*+ a(F(x)) , , N f a(u) , 

2B{F(x)) ™? u > v ; J x 2B(F{x)) V ' J a 2B{u) 

and the left-hand side is bounded by x + — x_ < +oo. 

We now prove that the condition is sufficient. For u S (0,1), let us define ^f(u) as in (21). 
Then ^ is absolutely continuous and since A is increasing, so is VP. Thus "J -1 is continuous, with 
finite variation. In case has a finite limit in 1 (resp. 0), we write 1 ^~ 1 (x) = 1 (resp. 0) for 
x E [\&(l),+oo) (resp. (—00,^(0)]) so that "J -1 is a cumulative distribution function on R. We 
now check that is a solution, in the sense of distributions, of the stationary equation. Let 
4> E C£°(R) be a test function. Then by the integration by parts formula [34, (4.5) p. 6] and the 
chain rule formula applied to A(^/~ 1 (x)), 

- l -Jj{x)A{^-\x))dx = i^</>( 2 ;)a(*- 1 (x))dvl/- 1 (x). 

By the change of variable formula and the definition of ^ , 

- / 0(a:)a(*" 1 (a;))d*~ 1 (a;) = - [ </>(*(u))o(u)d« = f (j)(^(u))B(u)^' {u)du, 
2 Jr 2 Jq Jo 

and performing a new change of variables in the last member above yields 

1 



<j)'{x)A(^- 1 (x))dx = J </)(a;)B(*" 1 (a;))dx, 

i.e. is a solution in the sense of distributions of the stationary equation. 

We finally assume that B(u) > for all u E (0, 1) and prove that F is a solution if and only if it 
is a translation of , F~ 1 . If F is a solution, since B(u) > then "J is C 1 on (0, 1) so that the chain 
rule formula gives 

d^(F(x))) = y'(F(x))dF(x) = ^0^ dF(x) = dx, 
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where the last equality holds due to B(u) > 0. Then ^(F(x))-^f(F(0)) = x and F(x) = , $>~ 1 (x+x) 
with x — ^(F(0)). Reciprocally it is immediate that all the translations of solve the stationary 
equation. □ 

Remark 3.2. When the condition that B(u) > on (0, 1) is not fulfilled, then one can exhibit 
solutions that are not translations of each other. For instance, let a(u) = u(l — u)\u — 1 /2 1 3 / 2 and 
B(u) = u(l — u)(u — 1/2) 2 . Then a is continuous on [0, 1], its integral satisfies (Dl), B is C 1 on 
[0, 1], B(0) = B(l) = and B(u) > 0. Besides, = sgn(u- l/2)|u- 1 /2| 1 / 2 . For all h>0, let 
us define 

x<-l/-\/2, 



Foo,h( X ) ;= ' 



1/2 -x 2 -1/V2<x <0, 

1/2 0<x<h, 
l/2+(x-h) 2 h < x < h+ 1/V2, 
1 x>h + l/\/2. 

Then -Fqo o = ^ r_1 an d for all h > 0, i 7 ^ h solves the stationary equation although it is not a 



translation of ^ 



-l 



In order to apply the results of Section 2, we need criteria ensuring the existence of a first order 
moment as well as the classical regularity for a stationary solution Foo . They come as a corollaries 
to Proposition 3.1. 

Corollary 3.3. Under the nondegeneracy condition (D2), the regularity condition (Rl) and the 
equilibrium condition (El), all the stationary solutions Foo are C 2 on K. 

Proof. By Proposition 3.1 and the condition (El), it is enough to prove that is C 2 on R, 
which follows from the inverse function theorem, since ^ is C 2 on (0, 1) by (Rl) and (El), and 
*'(u) > for all u e (0, 1) by (D2). □ 

Corollary 3.4. Under the nondegeneracy condition (Dl) and the equilibrium condition (El), the 
solutions to the stationary equation have a finite first order moment if and only if (E2) holds. 

Proof. According to Proposition 3.1, it is enough to prove the statement for the solution the 
first order moment of which is given by 

1 r 1/2 r 1/2 a(u)u, f 1 o(u)(l - u) 



tf(u)du- / $(u)du= / -^-du+ / " ' Au 

1/2 Jo Jo 2B { U ) Jl/2 2B{u) 

by the Fubini-Tonelli theorem. The finiteness of the right-hand side is the condition (E2). □ 

Remark 3.5. In the case of the viscous conservation law and under the condition (El), it is 
sufficient that 6(0) > and 6(1) < for (E2) to hold, and in this case the probability distributions 
derivated from the stationary solutions have exponential tails and satisfy a Poincare inequality 
(see [28, Lemma 2.1]). In the general case of the stationary equation (20), and under the equilibrium 
condition (El), these results extend as follows. 

• Under the uniform ellipticity condition (D3), if 6(0) > and 6(1) < then it is clear 
from the expression of ^ that the stationary solutions still have exponential tails, and they 
consequently satisfy a Poincare inequality. If 6(0) = (resp. 6(1) = 0), then the left (resp. 
the right) tail is heavier than exponential. 

• Under the nondegeneracy condition (Dl), as soon as the cumulative distribution function 
^E' -1 admits a positive density p, then it satisfies a Poincare inequality if and only if it 
satisfies the Hardy criterion (see [2, Theorem 6.2.2, p. 99]), namely 

[ +c ° f x f x f° dy 

sup / p{y)dy / — - < +oo, sup / p{y)dy / —— < +oo. 

Jo P{V) x<oJ-oc Jx p{y) 



x>0 

Letting y = ^(v), we rewrite 

dy 



p{y) 



(*») 2 dv, 

1/2 
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so that that the stationary solutions satisfy a Poincare inequality if and only if 

. f u ( a(v) \ 2 , f 1/2 ( a(v) \ 2 , 

sup (1-u) — j-- dv < +00, sup u / — — dv < +00. 

u>l/2 Jl/2\ 2B { V )J u<l/2 Ju \ 2B ( V )J 

3.2. Convergence in Wasserstein distance. We now state the main result of the article, namely 
the convergence to equilibrium of the probabilistic solutions in Wasserstein distance. 

Theorem 3.6. Let us assume that: 

• the coefficients of the Cauchy problem (1) satisfy the uniform ellipticity condition (D3), 
the regularity condition (R2) and the equlibrium conditions (El) and (E2); 

• the probability distribution m has a finite first order moment; 

• W 2 (H *m, < +00. 

Let F be the probabilistic solution the Cauchy problem (1) with initial condition H *m. Then there 
exists a unique stationary solution Foo such that Fq and Foo have the same expectation, and for 
all p>2 such that W P (H * to, \I> _1 ) < +00, 

Vl<q< P , lira W q (F t ,F QO ) = 0. 

Proof. The proof is in 6 steps. 

Step 1. We first prove the existence and uniqueness of a stationary solution Foo such that 
Fq and F^ have the same expectation. By the condition (E2), the integral J Q ^>(u)du is defined. 
Owing to the condition (El) and according to Proposition 3.1, the stationary solutions are the 
functions of the form Foo(x) = ^~ x (x + x), x <G R. By Corollary 3.4, the expectation of such a 
cumulative distribution function exists and it is given by 

1 ,1 
F oc 1 (u)du = / ty(u)du — x. 
Jo 

Thus, the unique stationary solution with the same expectation as Fq is given by Foo(x) = 5' _1 (a; + 
x), with x = Jq(^(u) - F~ 1 (u))du. 

Step 2. We now introduce a smooth approximation of the initial condition F in order to 
use Lemma 1.6. Let ( be a C°° probability density on R with compact support and such that 
L x((x)dx = 0. For all a > and x G R, we define Q a (x) := a~ 1 ((a~ 1 x) and Fq(x) := F * ( a (x). 
Then Fq is C°° on R and it is the cumulative distribution function of Xq + aZ, where Xq has 
distribution to, Z has density £ and Xq and Z are independent. As a consequence, Fq has a finite 
first order moment and it has the same expectation as Fq and F^ due to Step 1. By Lemma 1.6, 
the probabilistic solution F a to the Cauchy problem (1) with initial condition Fq has the classical 
regularity and for all finite T > 0, it belongs to C 1 ' 2 ([0,T] x R). 

Using the obvious coupling (Xq, Xq+cyZ) of Fq and Fq, we note that for all q > 1, W g (Fo, Fq) < 
aEdZl 9 ) 1 / 9 < +00. This leads to the following remarks: 

• As is a translation of by the triangle inequality, W q (H*m, and W q (FQ , Foo) 
are simultaneously finite or infinite. 

• Using the triangle inequality again and Proposition 2.1, we get 

W q (F t ,Foo) < W q (F t ,F f a ) + W q (F t a ,Foo) < aE(\Z\ q ) 1 ^ q + W q (Ff* , Foo)- 

Hence, as soon as, for all a > 0, limsup t _,. +00 W q (F^,F ao ) = 0, then taking a arbitrarily 
small yields lim t ^ +00 W q (F t , Foo) = 0. 
We now fix a > 0. The remaining steps are dedicated to the proof of the fact that, for all p > 2 
such that W P (H * to, l^ 1 ) < +00, for all 1 < q < p, lim t ^ +oc W q (F" , Foo) = 0. 

Step 3. We prove that for all t > 0, the expectation of F" remains constant. By Corollary 1.12, 
for all t > 0, F" is the marginal cumulative distribution function of the nonlinear diffusion process 
X a solution to (4) with initial condition having cumulative distribution function Fq . Since a and 
b are bounded, 

Vt > 0, E(A7) = E(A Q ) + / E[b(K(xm<l S . 
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But for all s > 0, F S Q is continuous so that E[6(F S Q (X S Q ))] = /J b(u)du = B(l). By (El), we 
conclude that E(X?) = EpQf). 

Step 4. We now describe the evolution of the Wasserstein distance W^F", Fx,). We are 
willing to use Proposition 2.3, therefore we need to check that F a and F^ satisfy the assumptions 
of Lemma 2.2. It is the case for F a thanks to Lemma 1.6. The stationary solution F^ has a finite 
first order moment owing to the condition (E2) and Corollary 3.4, it is C 2 on R by the condition 
(R2) and Corollary 3.3, and from the definition of "f^ 1 and condition (D3) it follows that the 
derivative of Fx, is bounded by 2||i?|| 00 /tj. 

Moreover, by the assumption that W%{H *m, "f^ 1 ) < +oo and Step 2, W2(F ol ) F oo ) < +oo; and 
since both Fq and Fx, have a finite first order moment, |a;|(tl(F(f , x) + t\(Foo,x)) vanishes when 
x — > ±oo. Thus, Proposition 2.3 applies to F a and Fx, with p = 2 and yields, for all < ti < i2, 

Wl(F«,F 00 )-W?(FZ,F 00 ) = - f f a(u) ^ffj'^ '^^ dudt < 0. 

Jt x Jo d u {F?) ^ujduFoo (u) 

Using the uniform ellipticity condition (D3), we can then assert that 

hin[/ - fr w)-i-)-ww) , d , BOi 

and extract a sequence (£ n )n>i growing to +oo such that the integral above goes to along (t n ) n >i- 
Let us prove that for all u € (0, 1), 

(22) lim / \d u (F t a n )- 1 (v)-d u F- 1 (v)\dv = 0. 

Let < e < 1/2. By the Cauchy-Schwarz inequality and the condition (El), 
(23) 

( d u (F«)~i - d u F ~i) 
d u (F t a J-^d u F 

,2 \ 1/2 



J ~ e \d u (F t l)-l-d u F-l\dv<l£ " d u (F t Z)-'d u F-'dv J " { ijpay.g^ ' dv 



2 \ 1/2 



^ e[ e,i-e] 2B(«) ^ l »' ^ 0„(F£)-i8 u f£ 

The first integral can be bounded uniformly in n as follows: 

d u (F t a J-\v)dv = (fZ)- 1 ^ - e) - (F t Z)-\e) 
< - r / \(F£)- 1 (l-e + v)-(Fg t )- 1 (e-v))dv 



e 

9 / W 2 



< - 

e 



(F-\l-e + v)-F- 1 (e-v))dv+ / |(F t ") _1 (i;) - F~ x (t;)|d' 



< - ( f' 2 (F~\l - e + v)-F- 1 (e- «)) d« + T Kitf)" 1 ^) - F-»|d« 



r 



where the last inequality is due to Proposition 2.1. We deduce that the right-hand side of (23) 
goes to 0, so that taking e < u A (1 — u) yields (22). 

Step 5. We extract a subsequence of (t n ) n >i, that we still index by n for convenience, such 
that lim„^ +00 (F^)- 1 (l/2) - F~ 1 (l/2) = I G [-oo, +oo]. Then using Step 4, for all u e (0, 1) one 
has (F t ") _1 (it) — i 7 ' 00 1 (u) — > i. Besides, since by Proposition 2.1, 

sup f |(J^)- X («) - ^(«)| 2 d« = W-2 (Fq , ) < +oo, 
t>0 Jo 
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then the functions (u H> (F") (u) — i 7 ^ 1 («))„>! are uniformly integrable. We deduce using 
Step 3 that 

1= Urn f ((F t l)- 1 (u)-F- 1 (u))du = 0. 

Step 6. Let p > 2 such that W p (Fff, < +oo. Then by Step 2, W p (Fg, F^) < +oo; 

therefore, for all 1 < q < p, the functions (u n- \{Fj* )~ 1 (u)— F^ 1 (u)\ q ) n >i are uniformly integrable, 
and using Step 5 we have 

lim [ 1 \(F t a J- 1 (u)-F- 1 (u)\«du = 0. 

But according to Proposition 2.1, the flow t W q (F^,F 00 ) is nonincreasing. As a consequence 
limi_,. +00 W q (F" , Fqo) = and the proof is completed by virtue of Step 2. □ 

3.3. Rate of convergence. We first recall the result of convergence to equilibrium stated in [28], 
where A(u) = a 2 u with a 2 > 0. Then it is easily checked that the conditions (Rl), (El) and (E2) 
are satisfied if B is C 2 on [0, 1], with 5(1) = 0, b(0) > 0, 6(1) < and B{u) > on (0, 1). Then 
according to Remark 3.5, all the stationary solutions F^ admit a positive density p x and satisfy 
a Poincare inequality. Under these assumptions, we have the following convergence result. 

Lemma 3.7. [28, Lemma 2.8] There exist r\ > and c > depending on A and B such that for 
all cumulative distribution function Fq with a finite first order moment, calling F^ the sationary 
solution with the same expectation as Fq, as soon as J (Fq — i 7 'oo) 2 /Pooda; < rj then 

f (F t ( X ) -FM)\ X < ggH*) f (F ( X ) - F ( X )f dx 

JR Poo{X) c J r Poo{x) 

According to [27, Proposition 1.4], the quadratic Wasserstein distance between two probability 
distributions fi and conl such that fi admits a positive density p satisfies the inequality 

w^)<±f {H *^ x) - H * v{x)? d*. 

Jm P(x) 

Hence, the convergence result of Lemma 3.7 can be translated in terms of the Wasserstein distance. 

Corollary 3.8. Under the assumptions of Lemma 3.7, as soon as J(Fq — F^) 2 /p^dx is small 
enough, then WaC-F*) -Poo) converges to exponentially fast. 



Appendix A. Proof of Proposition 1.2 

This appendix is dedicated to the proof of the first point of Proposition 1.2, which states that 
there is at most one weak solution to the Cauchy problem (1) in the set J-(T), for all T > 0, 
possibly T = +oo. The proof is adaptated from Wu, Zhao, Yin and Lin [41, Section 3.2] as well 
as Liu and Wang [31], who provide uniqueness of bounded weak solutions to the initial-boundary 
value problem, namely the Cauchy problem (1) in the strip [0, T) x (0, 1) with boundary conditions 
at x = and x = 1. At an intuitive level, one can see our restriction to the set .F(T) as some 
boundary conditions at x = — oo and x = +oo. 

We shall follow the so-called Holmgren's approach which consists in turning the proof of unique- 
ness for (1) into a proof of existence for an adjoint problem. Recall that we make the following 
nondegeneracy assumption: 

(Dl) The function A is increasing. 

Let T > 0, possibly T = +oo, and let F\F 2 E F{T) such that for all g e C c °°([0, T) x R), both 
F 1 and F 2 satisfy (2). Then, for all t G [0,T), the function if - F/ is integrable on K and the 
function (s,x) F 2 (x) — F}(x) is integrable on Q t := (0,i) x K. Therefore, for all t e [0,T) and 
for all g G C* c °°([0, T) X R), (2) yields 

(24) J (F 2 - F, 1 ) ^Ad 2 g + Bd x g + 9 s3 | dsdx = J^F 2 {x) - F t 1 (x))g(t,x)dx; 
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where 



and 



A(s,x) = / a((l-9)F}(x)+0F*(x))d9, 



B(s,x) = / b((l-0)F}(x) + OF?(x))dO. 
Jo 

Remark A.l. For all t £ [0,T), by a classical regularization argument the integral equality (24) 
holds true for all function g in the space C h ' ([0,t] x R) of real-valued C 1,2 functions bounded 
together with their derivatives. 

Let / £ C c °°([0,T) x R). Then there exists t £ [0,T) such that Supp/ C [0,*) x R. Let us 
introduce the adjoint problem to (1) as 

1 7, 



(25) 



^Ad 2 x g + B8 x g + d s g = f 



(s,x) £ [0,t) x R, 

X £ R. 



g(t, x) = 

The coefficients A and B may not be smooth enough to allow the adjoint problem to admit 
classical solutions. Therefore we introduce a suitable approximation of (25). For small 8, r\ > 0, let 

G S := {(s,x) £[0,t] x R : \F}(x) - F 2 (.t)| < 6}, 
F s := {(s,x) £ [0, f] x R : \F}(x) - F 2 (x)| > 6}, 



(26) 

and let us define 







on Gs, 



X 5 Js,x) 



-(r) + A(s,x)) 



-1/2 



B(s,x) on Fs- 



Since A is increasing and F 1 ,^ 2 are bounded, there exist L{5) > and if (8) > independent of 
rj such that 

A(s,x) > L{8) (s,x) £ F s , 

\>$(8,x)\<K(S) (s, x) £ [0, i] x R. 

Let £ be a C°° probability density on R 2 such that Supp£ C [-1, 1] x [— 1, 1]. For all e > 0, 
let £ £ := e- 2 ^(e- 1 s 1 e- 1 x) and define A e = A * £ e and Af ; £ = A* * Then A e and A* e are C°° 
functions and all their derivatives are bounded on [0,t] x R. Besides, 

a.e. in [0, t] x R, 

a.e. in [0, t] x R, 

(s,x) £ [0,t] x R, 
(s,x) £ [0,t] x R, 



(27) 



lim ^4 e (s, = A(s, x) 

lmAf ;e (s,a;) = X S n (s,x) 

A e (s,x) < C 
\X s Vjt (s,x)\<K(5) 



where C refers to a positive constant independent of e, 8 and 77, and K{8) refers to a positive 
constant depending only on 8. In the sequel, the values of C and K{8) can change from one line 
to another. 

We finally define 



{1-, + A t {s,x)) 



B s V!e (s,x)=X s nte (s,x) \ 

and emphasize the fact that 

(28) HB^IU 

We are now able to introduce the approximate adjoint problem 



1/2 



f 



(29) 



-(r) + A e )dig + B° te d x g + d s g = f 
g(t,x) = 



(s,x) £ [0,t) x R, 
x £ R. 
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The coefficients of the equation are bounded, globally Lipschitz continuous, the operator is uni- 
formly parabolic, and the right-hand side / is continuous and bounded. Therefore the Cauchy 
problem (29) admits a unique classical bounded solution g^ e (see [29, p. 369]). Since the coeffi- 
cients of the equation and / are C°° on [0,t] x K, then so is g^ e (see [17, p. 263]). Owing to the 
Feynman-Kac formula, g & e has the following probabilistic representation: 



(30) 



V(« I x)6[0,t)xR ) 9 S r,,e(s,x) 



-E 



f(r,Z s r ' x )dr 



where, for a given standard Brownian motion W, (Zp x ) r ^\o,t] is the unique strong solution of the 
stochastic differential equation 



x+ B^(u,Z^)du+ / (r ] + A e (u,Z^)) 1 / 2 dW u . 



(31) 



Lemma A. 2. The functions g^ e) d x g%. e and d£gt e are such that: 



(32) 

(33) 
(34) 
(35) 



sup |ff*, e (s,x)| < C, 

[0,t]xK 



sup 

«e[o,t] Js. 



g d Vte (s,x)\dx<K(S), 
sup \d x g s n t {s,x)\ < K(e,d,n)cxp(-x 2 /K(e,8,T])), 

2 



se[o,t] 

sup \d*g^ £ (s,x)\ < K(e,S,r])exp(-x 2 /K{e,S,T])) 



se[o,t] 



where the value of K(e,S,T]) can change from one line to another. 

Proof. The inequality (32) directly follows from the Feynman-Kac formula (30). Besides, since / 
has a compact support in [0,t] x R, say Supp/ C [0,t] x [a;_,x+], one has 



|<4MI<II/ 



and for x > sc+, 



J (^' x € [a:_,a + ]) < W(x - > a: - x+) < 



P(Z;' X G [x_,x+])dr, 

E{{x - Zp x ) 2 ] 



(x — x+y 



Owing to (31) and (28), 



n(x ZP*Y\ <2[{r- sYWB^ + (r - s)\\r, + A e |U j < K{6), 

and similar arguments for x < X- yield (33). 

In order to prove (34) and (35), let us take the derivative with respect to x of the problem (29). 
Then the function d x gf, e is the unique classical solution of the Cauchy problem 

\{r, + AJdlg 1 + (\d x A e + B*\ d x g x + d^g 1 + d^ 1 = d x f, 



(36) 

g 1 {t,x)=0, 
and the Feynman-Kac formula now writes 



-E 



(37) 



d x f[r, Zl^*) cxp (J d x B 6 v>e {u, Z^ x )du ) dr 
9 x f{r, z)G 1 (s, x\ r, z)dzdr 



where (Z^ ,s,x ) re ^ t i is the associated diffusion process and G 1 (s, x; r, z) is the fundamental solution 
of (36). Following Friedman [17, p. 24], there exists some constant n > depending on the 
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coefficients of (36) (therefore, on e, 6 and rf) such that, for all s < r, 

(38) IG 1 ^, W )| < " exp (-^=^r) , 

(r — s) 1 /^ \ n(r — s) J 

(39) |S KG ri( S) x;r^)|<-^expf-^^y 

r — s \ K[r — sj J 

For x > x+, (37) combined with (38) yields 

|3,4>.*)l s /'£. IIM- <7^ exp Hf=#) d2dr 

< «ll<WII-(«* - ..)«p (-fc=^±f ) jT F ^ yl7J 

< 2(1 - «) i ' 3 «ii3,/iu,(i + - i_)op f- ^r^n . 

and similar arguments for x < X- lead to (34). Likewise, for x > x+, using (39) one gets 
|SXeMI< /' r H^/Hoc— exp f- ( " + H dzdr 

<«||Sic/||oo(» + -x_) / — exp(- ( X ~ X+) * ) dr. 

J s r - s \ K(r-s) J 

Writing, thanks to the change of variable v = (x — x+)/(n(r — s)) 1 / 2 



exp — dr = / — exp(— v )dv, 

r-s \ K(r-s) J J(x-x + )/( K {t-s)y/ 2 v 

and using the fact that, as soon as x > x + + {tn) 1 / 2 , 

x — x + 1 n{t — s) 

- (K(t - S))V2 ' v ~ ix-X^' 

we deduce that for x > x+ + {tn) 1 / 2 , 

+°° " K{t-s) ( {x-x+) 2 \ ( {x-x+f 



/ + oo 2 
— exp(— v 2 )dv < 
.X-X J- )/(K(t-s)) 1 / 2 V 



■ exp — < exp . 

l(x-x + )/( K (t-s))V2 v [x-x+y V K{t-a) ) \ K(t-S) 

By similar arguments for x < x~, one finally concludes to (35). □ 
By the definition of g^ e , 

(F 2 (x) - F}(x))f(s,x)dsdx = f (F 2 {x) - F x s {x))f{s,x)dadx 



(40) 



[0,+oo)xR 



/ ~ F*) {\(v + I £ )^<4 + B^dxg 5 ^ + 9 s9 S n , e } dsdx-. 



It follows from the boundedness of A t and B^ e and from Lemma A. 2 that 

sup \d 3 g* >e (s,x)\ < K,(e 1 5,r l )exp(-K(e,8,r])x 2 ). 
se[o,t] 

Consequently, g^ f G C^' 2 ([0,t] x M), therefore due to remark A.l, 

(41) J {F 2 - Ft) | \Ad 2 J lht + Bd x g s ^ + d s g^ dsdx = 0. 

As a conclusion, subtracting (41) to (40), 

(F 2 {x)-Ft{x))f( Sl x)d S dx 



(42) 



[0,+oo)xR 



J (Fs F^ jifo + l e - i)^<e + (B* e - B)ft,^, e } dsdx. 
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We now have to prove that the right-hand side of (42) goes to as e, 5, rj — > 0. In this purpose, we 
closely follow the line of [31]. In particular, the proofs of our Lemmas A. 3 and A. 4 are nothing but 
transcriptions of the proofs of Lemmas 1 and 3 in [31] to the framework of an unbounded domain 
Qt and weak solutions in F(T). Then the estimates (32)-(35) ensure that the computations still 
make sense. 

Recall that C refers to a positive constant that does not depend on e, rj or S. 
Lemma A. 3. [3] , Lemma 1] The functions dxg^ e and d 2 g^ e are such that: 

(43) / h v + A e )(d 2 x g s Vje ) 2 dsdx<^p- + C, 

JQt 1 V 

(44) / (d x g 5 ve ) 2 dsdx<^ + C. 

JQt V 

Lemma A. 4. [31, Lemma 3] The function d x g^ e is such that: 



(45) sup / \d x g^(s,x)\dx<C. 

se[o,t] Js. 

The estimates of Lemmas A. 2, A. 3 and A. 4 give sufficient uniformity over the derivatives of g^ e 
to conclude. 

Proposition A. 5. The right-hand side of (42) is arbitrarily small when e,r},8 — > 0. 
Proof. For lighter notations, let us denote F(s,x) = F 2 (x) — F^(x), and 

I := I F\{A t - A)d 2 J rie dsdx+ ( F T j-d 2 x g 5 dsdx+ [ F(B 6 - B)d x g s dsdx 
JQt 1 JQt 1 JQt 

= : h+h+h. 

Recall that since F 1 ,F 2 g T(T), then F e (L 1 n L°°)(Q t ). 
Owing to the Cauchy-Schwarz inequality, 

/ ~ ~ \ 1 1 /2 

(46) |7r|< (J \F\ { ^~f dsdx) Qf \F\ l -{ v + A t )(d 2 x g^) 2 d S dx^ . 

Using (27), by dominated convergence the first integral in the right-hand side of (46) goes to 
as e — > for fixed ?y and 8. According to Lemma A. 3, the second integral in (46) is bounded by 
C + K(S)/n. Therefore, for fixed r\ and S, lim^o^! = 0. 

Let a > 0. Recalling the definition (26) of F a and G a , let us write 

\^\<lJ F \Fd 2 x g 5 n Jdsdx+^J G \Fdlgtjdsdx. 

By the Cauchy-Schwarz inequality, 



L ms - Maix - (L |p| o7iffx)) 1/J (L n\<*+Mw-*y 



L(a) V V 
where L(a) only depends on a. Likewise, 



'(1/2)( V + A 

*&(«(<> - m 



1/2 
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so that for fixed 77, a, 



limsup|/ 2 | < Cr) ( C 



{ L(a) 



Finally, let us write 

|/ 3 |</ \F(B s Vte -B)d x g s v Jdsdx+ f \F(Bf he ~ B)d x g s v Jdsdx. 

We first deal with the integral on G$. On account of Lemma A. 3, for given 5 and r\ the family 
(\^x9n d)e>o is bounded in L 2 (Q t ). Therefore there exists a sequence (tk)k>i decreasing to 0, 
such that e J converges weakly in L 2 (Q t ) to a function /i > when fc — > +00. From now 

on, the convergence e — > will always be understood along the sequence (ek)k>i- According to 
Lemma A. 4, for all compact subset D C Qt, 



/ hdsdx = lim / \d x gf. Jdsdx < C 
Jd e-*>JD 



D 

so that 

hdsdx < C. 



Furthermore, on G& one has B^ e — > a.e. when e — > 0, and \\B°. e ||oo < A'(J). Since F G 
(L 1 n L°°)(Q t ) C L 2 (Q t ), by dominated convergence one deduces that 1g 5 \F(B^ — B)\ converges 
strongly in L 2 (Q t ) to l Gs \FB\. Finally 

lim / \F(M . ~ B)d x gi Adsdx = f \FB\hdsdx < CS. 
e ^ JGs Jg 5 

We now turn to the integral on Fg. By the Cauchy-Schwarz inequality, 
[ \F(B^ ie -B)8 x g s Vte \dsdx<( f |F|(fl£ e - Bfdsdx) ( [ ^(a^^dsd^ 

J Fs \Jf s J \JFs 

Owing to Lemma A. 3 and the boundedness of F, 

K(5) 

\ r \ K u xy v e) u *' ux ' ^ ^ -T- 

'F 5 



[ \F\{d x gl e ) 2 dsdx<C 

J Fx 



and by construction, B ^ B a.e. in Fg when e — > 0, while ||-B* e ||oo < tC(5). By dominated 
convergence, on concludes that for fixed i] and S, limsup^Q I/3I < CS. 
Combining the previous estimates, let us now write 

hmsup |/| < Crj ( C +X®) 1/a (^+(°) ^ + CS 



and conclude by taking consecutively 77 — > 0, a — > and S — > 0. □ 
It follows from Proposition A. 5 and (42) that 

(F 2 (x) - F}{x))f(s,x)dsdx = 0. 



/ 

J TO. 



'[0,T)xR 

Since / is arbitrary, Fj(x) — F 2 (x) a.e. in Q t , and this holds for all t £ [0,T). As a consequence, 
f 1 = F 2 in 7"(T). 

Appendix B. Proof of Proposition 2.3 

This appendix is dedicated to the computation of the time derivative of the flow t n- W^(F t , Gt) 
of the Wasserstein distance between two solutions F and G of the Cauchy problem (1) with 
respective initial conditions Fq and Go and classical regularity. In Subsection B.l, we gather some 
tail estimates on the solutions F t and G t as well as their space derivatives. In Subsection B.2, we 
use a new expression of W^(F tl G t ) in terms of F t and G t to prove Proposition 2.3. 
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B.l. Tail estimates. In this subsection we are concerned with the asymptotic behaviour of F t (x) 
and d x F t (x) when |x| is large. 

Lemma B.l. Under the assumptions of Proposition 1.1, for allt > 0, there exists a finite constant 
C{t) > such that the function 1 1— > C(t) is nondecreasing and: 

< -C(t), l -F {2x - C(t)) < F t (x) < F (| + C(tj) + exp (~^y) , 
V* > C(t), - F Q (2x + C(t))} < 1 - F t (x) < 1 - F (| - C(tj) + exp (~^y) ■ 

Proof. Fix a finite T > 0. We will use the process (-Xt)tg[o,T] introduced in the proof of Lemma 1.9. 
Let t £ [0,T] and define d{t) := *||b||oo- Ki < -C*i(i), then 



F t (s)=P X + / 6(r,X r )dr+ / a(r, X r )dW r < x 



>¥[X + / a(r, X r )dW r <x-t\ 



>¥[Xo<2{x-t\\l\\ 00 ), 



<t 



a(r,X r )dW r 



<t 



a{r,X r )dW r 

X < 2x - C 2 (t) ) P(X < 2x- - C 3 (t)), 



where C 2 (t) := 2d(t) = 2t||&|| 00 . By Chebyshev's inequality, 



a(r,X r )dW r 

E 



<t\ 



> 1 - 



> 1 - 



a(r,X r )dW r 



X < 2x-C 2 (i) 
A < 2x-C 2 (t) 



(*[|&|[oo-^) S 



t\\a* 



WI^IU-^) 2 ' 



and the right-hand side is larger than 1/2 as soon as x < —C${t) :~ — (2t||cr 2 || 0O ) 1 / 2 . 
As far as the upper bound is concerned, for x < —C\(t), 



F t (x)=F[X + / b(r, X r )dr + / a(r, X r )dW r < x 



< P X + / a(r, X r )dW r <x + t\ 



X = y) m(dy) 



a(r,X r )dW r <x-y + t\ 



Let us fix x < —Ci(t). For xo £ R, let us split the integral in the right-hand side above in two 
parts, integrating respectively on (— oo,x ] and (xq,+oo). Then the first part can be bounded by 
-Fo^o); whereas for the second part the exponential Markov inequality yields, for all A > 0, 



a{r,X r )dW r <x-y + t\ 



X = y) <exp [Xix-y + tWbW^ + tX 2 



l^lloO 



and finally, F t (x) < F (x ) + cxp (\(x - x + t\\b\\oo) + tA 2 ||cr 2 ||oo/2). As soon as x > x + i||6||oo, 
optimizing this expression in A > yields 



F t (x) < F (x ) +cxp 



(x - XQ + tWlWco)' 
2i||a 2 || 0O 
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We now choose x_ = (x+t||6|| 00 )/2, thenx < and F (x )_= F ((x+t\\b\\ oo )/2) = F (x/2+C 4 (t)) 
with C 4 (i) := i||6||oo/2. Moreover, for x < -C 2 (t) = -2t\ 



I OO 1 



\2 



where C§(t) := 32£| |<r 2 1 |oo ■ We get the first part of the lemma by taking C(t) as the maximum of 
Ci (t) , . . . , C5 (t) , and the second part follows similarly. □ 

Assuming the classical regularity of F, we now derive some estimates on the probability density 
d x F t from the celebrated Aronson inequalities on the fundamental solution of a parabolic equation 
with divergence form operator [3] . Due to the possible dispersion of the initial condition, our upper 
bound contains an extra tail term in addition to the classical Gaussian term. 

Lemma B.2. Under the assumptions of Lemma 2.2, for all < t\ < t%, there exists a positive 
constant K > 0, depending on t\ and ti, such that for all (t,x) £ (^1,^2] X R> 

(47) Kit-t^ exp ("S) - M{x) - irwr* ( cxp {-K(Tto) + tl ( Ft 4) 

Proof. The assumptions of Lemma 2.2 together with Corollary 1.12 ensure that the nonlinear 
martingale problem of Subsection 1.2 has a unique weak solution P. We denote by X the associated 
nonlinear diffusion process. Let L(s, y; t, dx) = F(X t £ dx\ X s — y) be the transition probability of 
X. The generator 

Lf := l -a{F t {x))d 2 x f + h{F t {x))d x f 

is uniformly elliptic, and by the regularity assumptions on A, B and F , it rewrites in the divergence 
form 

Lf = \d x (a(F t {x))d x f) - (^a'(F t (x))d x F t (x) - &(#(*)) J d x f. 

Let < ti < t%. The assumption of boundedness of d x Ft(x) on [£1,^2] x K ensures that the 
coefficients of the latter form are bounded. Thus, owing to [3], T(s,y;t,dx) admits a density 
g(s,y;t,x) and there exist some positive constants 7,, ftj, i £ {1, 2}, depending on t\ and t%, such 
that for all t £ {ti,t 2 ], 

Ki ( (x — y) 2 \ , , K2 f (x — vY 

■ ex P ~ n i \ — 9(^1, y; t, x) < — — -—7775- exp 



Hence, p£ (x) < d x F t (x) < p 2 (x), where 



For all x > 0, 



while 



Likewise, for x < 0, 



while 



^ exp f- (g . *f . ) 



72(t-h) 

so that the upper bound of (47) holds for any K > k 2 V (472) 
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As far as the lower bound is concerned, there exist X- < < x+ such that F tl (x+) — F tl (x_ ) > 
1/2. Then for all x g R, 

Kl ( (ix + V -£_) + \x^ 2 

> n „ TTTTo ex P 



2(t-t 1 )V2 "V 7i(*-*0 
Now there exists X > large enough, depending on ki, 71 and x + V — such that for all iff, 

Ki / ((x+ V + |d) 2 \ 1 / Kx 2 
exp r > — exp 1 



2 V 7i(*-*i) / ~ K \ t-ti 

which results in the lower bound of (47). □ 

B.2. Proof of Proposition 2.3. We first give a general formula for the Wasserstein distance 
W P (F,G) between two cumulative distribution functions F and G. 

Lemma B.3. Let F and G be two cumulative distribution functions on R. Then, for all p > 1, 
(48) W*(F, G) = p(p - 1) f l {x<y} ([G(x) - F(y)}+ + [F(x) - G(y)} + ) (y - x) p - 2 dxdy. 

JR 2 

Proof. Let us split the right-hand side of (48) into two symmetric integrals in F and G. Thanks 
to the Fubini-Tonelli theorem, the first integral writes: 

l {x<y} [G(x) - F(y)] + p(p - l)(y - xf^dxdy 

1 {x<y,G(x)>F(y)} (J 1 {F( y )<u<G(x)}duj p(p - l)(y - x) p ~ 2 dxdy 
t{x<y;F( y )<u<G(x)}V(p - l)(v - x) p ~ 2 dxdydu. 



10 JR 2 

By the definition of the pseudo-inverse functions F _1 and G _1 , note that for all x,y £ R and 
u e (0, 1), F(y) < u if and only if y < F _1 (u) and G(x) > u if and only if x > G _1 (w). Thus, the 
right-hand side above rewrites 

-I 

1 {G- 1 («)<r< y <_F-i(t l )}P(p - - x) p ~ 2 dxdydu 

1 rF-^u) rF-^u) 

~L{G-i(u)<F-i(u)} / pip - l)(y - x) p ~ 2 dydxdu 

Jx=G- 1 (u) Jy=x 



1 

llG-HuXF-HuniF-'iu) - G~ 1 (u)) p du, 





and we conclude using the symmetry in F and G of the two integrals in the right-hand side 
of (48). □ 

We are now ready to complete the proof of Proposition 2.3. 

Proof of Proposition 2.3. For all t > 0, (48) yields 

(49) W p (F t , G t ) = pip - l)(I(Ft,G t ) + UG t ,F t )), 

where we define I{F t ,Gt) ■= J R2 ~L{x<y}[Gt{x) — F t iy)] + {y — x) p ~ 2 dxdy. The assumption that 
W p (Fq,Gq) < +00 combined with Proposition 2.1 ensures that both I(Ft,Gt) and I{Gt,Ft) are 
finite. 

For all M > 0, let us denote 

I M (F t ,G t ):= f l { _ M£x<y < M} [G t (x)-F t (y)] + (y-x)i>- 2 dxdy, 
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then by the monotone convergence theorem, lirriM-y+oo lAi(Ft, Gt) = I(F t ,Gt) < +00. Owing to 
the assumption of classical regularity on F and G, the function t n- lM(F t ,G t ) is C 1 on (0, +00) 
and for alH > 0. 



dt 



(50) 



I M (Ft,Gt)= / l{-M<x< y <M}l{G t (x)>F t (y)}(dtG t (x) - d t F t (y))(y - x) p 2 dxdy 

JR 2 

l{-M<x<y<M;G t (x)>F t (y)}dx (J^a(G t (x))d x G t {x) - B(G t {x))j (y - x) p ~ 2 dxdy 
L {-M<x<v<M-G t (x)>F t ( v )} d x (\a(F t {y))d x F t (y) - B(F t (y))j (y - x) p ~ 2 dxdy. 



Let us define iptt( x ) '■= M A F t X { G t{x)) and ip M (x) := (-M) V G' f r 1 (F t (x)). Then the first 
integral in the right-hand side of (50) rewrites 



1 W(y)<C(y)} f (V - x Y~ 2d * (\a{G t (x))d x G t (x) - B(G t {x))\ dxdy 



pM ry 

1 {Ft(y)<G t (y)} \ 
<y=—M Jx=ip M (y) 

and integrating by parts, we get 



(y - x) p ~ 2 d x (^a(G t (x))d x G t (x) - B{G t (x))j dx 

-(V - VM{y)) p - 2 (\a(G t (-M) V F t ( y ))d x G t (<p- M (y)) - B(G t (-M) V F t (y))\ 
(p - 2)(y - xf- 3 (^a{G t {x))d x G t {x) - B(G t {x))j dx. 



v 



Now 



MF t (y)<G t (y)} I (p-2)(y-xr- 3 (\a{G t {x))d x G t {x) - B{G t {x))) dxdy 

y=-M Jx= Vm ( v ) \* J 

M (\ \ rAi( x ) 

V,W<G,W} \^-a(G t (x))d x G t (x) - B(G t (x))j J (p- 2)(y - xf^dydx 

1 {F t (x)<G t (x)} (^a{Gt{x))d x G t {x) - B{G t {x))^j (9?^ (x) - x) p ~ 2 dydx, 



lx=-M 

so that the first integral in the right-hand side of (50) finally writes 

J-M 



1 {F t t 



■ t {x)<G t {x)}{{v + M {x) - xY~ 2 (±a(G t (x))d x G t (x) - B(G t (x)) 
{x - v- M {x)y-*(±a{G t {-M) V F t {x))d x G t {y- M {x)) - B(G t (-M) V F t (x)j) } 



dx 



whereas the second integral similarly writes 

-M 



1 {Fv 



«W<G t (x)}{te(x) - xf - 2 (±a(F t (M) A G t {x))d x F t { v + M {x)) - B(F t (M) A G t (x)) 
-(x- ip M (x)r- 2 (±a(F t (x))d x F t (x) - B(F t (x))) }dx. 
Hence, we deduce that for all < t\ < £2, 

f t2 d 

lM(Ft 2 ,G t2 ) - lM(F tl ,G tl ) = J —lAi{Ft,Gt)dt = J M + J M + J M , 
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where 



MF t{x) <G t{x) }{{<P + M {x) ~ xY~ 2 [B(F t (M) A G t (x)) - B(G t (x))] 

-M 

+ (x- V - M {x)y- 2 [B{G t {-M) V F t (x)) - B(F t (x))]}dxdt; 



1 



t 2 r M 



j2 m'-=o / 1 {F t (x)<G t (x)}{('fiiAx) - x) p 2 a{G t {x))d x G t (x) 

L Jti J-M 

+ (x- <p M (x)) p - 2 a(F t (x))d x F t (x)}dxdt; 



1 



t 2 r M 



Jm == - « / / l { F t (.)<G t (,)}{(^(^) ~ x) p ~ 2 a(F t (M) A G t (x))d x F t (cp+(x)) 

L Jti J-M 

+ (x - v M (x)f- 2 a(G t (-M) V F t (x))d x G t {cp u {x))}dxdt. 



Integral term J| f . Since B is C 1 on [0,1], 

l { F t (x)< Gt (x)}ll&IU{(^(x) - xf- 2 |F t (M) A G t {x) - G t (x)\ 

-M 

+ (x- <p- M {x)f- 2 \G t {-M) V F t (x) - F t (x)\}dxdt 

||&|| 00 (2JV/)P- 2 {[G i (x) - F t {M)]+ + [G t (-M) - F t (x)] + }dxdt 

-M 

< f 2 ||6|| 00 (2M)P- 1 {tl( J F t ,M) +tl(G u -M)}dt. 

By Lemma B.l, for all M > 2C(t 2 ), for all t G [ti,t 2 ], 

t\(F t ,M) < tl(F , M/2 - C(t 2 )) + cxp(- M 2 /C{t 2 )), 
tl(G t , —M) < tl(G , -M/2 + C{t 2 )) + exp(-M 2 /C(t 2 )), 

so that \J M \ —t when M — > +oo due to the tail assumption on Fq and Go- 
Integral term J 2 r By the monotone convergence theorem, 

lim J 2 M = \ f 2 I l {Ft{x) < Gt{x)} {(F-\G t (x)) - xy- 2 a{G t {x))d x G t {x) 
+ (x- G^{F t {x))Y- 2 a{F t {x))d x F t (x)}dxdt, 

and the limit is finite as 



It! 



l {Ft(x) < Gt{x )}{F t -\Gt{x)) - xf- 2 a{G t {x))d x G t {x)dxdt 



<IMIoc/ / ^{Gtix^-x^dxGt^dxdt 

= IHU C Wp 2 (F u G t )dt 
Jti 

< ||o||oo(*2 -ii)V^„ p r 2 2 (F ,G ) < +oo, 



due to Proposition 2.1 (we take the convention that W§(Ft, G t ) = !)• 
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Integral term jf 4 . Note that 

r*2 r M 

l{F t (x)<G t (*)}(vi(a:) ~ - 2 a(f t (M) A G t (x))^F t (^+ (i))da;dt 

'ti J-M 

L {F t {x)<G t (x);F t (M)<Gt{x )} (M-xr- 2 a(F t (M))d x F t (M)dxdt+ 

-M 



l { F t (.)<G t (,);f f (A/)>G fW }(i ;1 r i (G t (x))- a; f-^(G t (x))5^ t ( J F t - i (G t (x)))d a; di. 

ti J —AI 

According to Lemmas B.l and B.2, letting G := G(ii/2), for M > 4C, the first integral in the 
right-hand side of (51) is bounded by 



a 



r*2 

(2M)f" 1 5 :E F t (M)dt 

'ti 

< IWU f (exp (-^^y) +tl (F tl/2) f ) ) df 



and the right-hand side of the last inequality vanishes when Af — > +00, whereas the second integral 
in the right-hand side of (51) converges monotonically to 

/ 2 / lw^M^'tGtti)) -^- 2 a(G t (x))9 :r F t (F t - 1 (G t (x)))d a ;dt. 

The second term in Jj^ is similar. 

Conclusion. Taking the limit M —> +00 in the equality Im(Ft 2 ,Gt 2 ) — lM(Fti,Gti) — {Jm + 
Jm) = "^m now yields 

I{F t2 ,G t2 )~I{F tl ,G tl )- 1 - f 2 f l {Ft(:c) < Gt(:c)} {(F t - 1 (G t (x))-xr- 2 a (G t (.T))^G t ( a; ) 
+ (x - G f - 1 (F t (a;)))f- 2 a( J F 1 t (x))^F t (x)}d a; dt 

l{F f( x)<G t(;c )}(Fr 1 (G t (x))-.T)P- 2 a(G t ( a ;))a ;c F t (F- 1 (G t ( a ;)))d.Tdt 



2 



'ti 



„t 2 



+ / / l { F t (,)<G t (.)}(^-G t - 1 (F t ( a; ))F- 2 a(F(x))^G t (Gr 1 (F(x)))d a ;di 



"1 



The left-hand side of the equality above is finite and the integrands in both integrals of the right- 
hand side are nonnegative. Hence, all the integrals involved are absolutely convergent, and we 
deduce 



I(F t2 , G t2 ) — I(F tl ,G 



tit 



n 



' l { F t (x)<G t ^)}{(Ft~\G t (x)) - x)^ 2 (a(G t (x))(d x G t (x) - d^Ff 1 (G t (x)))) 



(x - Gt\F t (x))y- 2 (a(F t (x))(d x F t (x) - 8 x G t {G^ (F t (x)))) }dxdt. 
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By symmetry and using (49), we now conclude 
WP(F t2 ,G t2 )-WP(F tl7 G tl ) 



f 2 / {\F t -\G t (x)) - xr 2 (a(G t (x))(d x G t (x) - d^Ff 1 (G t (x)))) 

+ \x- Gt\F t (x))\ p - 2 (a(F t (x))(d x F t (x) - d x G t {G^ (F t (x))))}dxdt 

t { f \F t -\u) Gi\u)r 2 a{u) ( 1 - - 1 ) d u Gi\u)Au 
2 J tl Uo \duG t (u) d u F t (u)J 

+ I \Ff\u) - G^(u)r 2 a{u) ( 1 - 1 ) duF^^du) dt 
Jo \o u F t (u) d u G t {u)J J 



pip - 1) 



a(u)\F t (u)-G t (u)\ p — - — dudt; 

d u F t \u)d u G t \u) 



It! Jo 

which completes the proof. □ 
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